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Identification and Expression of Insect Steroid 
- -Receptor DNA-Sequences 

This invention was made in part with U.S. government 
5 support under Grant DCB 8405370 from the National Science 

Foundation. The U.S. government may have certain rights 
in this invention. 

FIELD OF THE INVENTION 
10 This invention relates generally to the use of 

recombinant DNA methods as applied to the nucleic acid 
sequences and polypeptides characteristic of insect 
steroid receptor superfamily members and, more 
particularly, to uses of such receptors and the DNA 
15 regulatory elements associated with genes whose 

expression they regulate for the production of proteins 
in cultured cells, and to uses of such hormone receptor 
proteins and genes in identifying new hormones that 
control insect development. 

20 

BACKGROUND OF THE INVENTION 
The temporal sequence of gene expression determines 
the nature and sequence of steps in the development of 
the adult animal from the fertilized egg. The common 

25 fruit fly, Drosophila melanocraster . provides a favorable 

model system for studying this genetic control of 
development. Various aspects of Drosophila development 
are representative of general insect and, in many 
respects, vertebrate development. 

30 The steroid hormone 20-OH ecdysone, also known as 

B-ecdysone, controls timing of development in many 
insects. See, generally , Koolman (ed.), Ecdysone; From 
Chemistry to Mode of Action . Thieme Medical Pub., N.Y. 
(1989) , which is hereby incorporated herein by reference. 

35 The generic term "ecdysone" is frequently used as an 

abbreviation for 20-OH ecdysone. Pulses, or rises and 
falls, of the ecdysone concentration over a short period 
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of time in insect development are observed at various 
stages of Drosophila development . 

These stages include embryogenesis, three larval 
stages and two pupal stages. The last pupal stage ends 
with the formation of the adult fly. An ecdysone pulse 
at the end of the third, or last, larval stage pulse 
triggers the beginning of the metamorphosis of the larva 
to the adult fly. Certain tissues, called imaginal 
tissues, are induced to begin their formation of adult 
structures such as eyes, wings, and legs. 

During the larval stages of development, giant 
polytene chromosomes develop in non-imaginal larval 
tissues. These cable-like chromosomes consist of 
aggregates comprising up to about 2,000 chromosomal 
copies. These chromosome aggregates are extremely useful 
because they provide a means whereby the position of a 
given gene within a chromosome can be determined to a 
very high degree of resolution, several orders of 
magnitude higher than is typically possible for normal 
chromosomes . 

A "puff" in the polytene chromosomes is a localized 
expansion or swelling of these cable-like polytene 
chromosome aggregates that is associated with the 
transcription of a gene at the puff locus. A puff is, 
therefore, an indicator of the transcription of a gene 
located at a particular position in the chromosome. 

A genetic regulatory model was proposed to explain 
the temporal sequence of polytene puffs induced by the 
ecdysone pulse which triggers the larval -to-adult 
metamorphosis. See , Ashburner et al., "On the Temporal 
Control of Puffing Activity in Polytene Chromosomes," 
Cold Spr ing Harbor Svmn. Quant. Biol. 38:655-663 (1974). 
This model proposed that ecdysone interacts reversibly 
with a receptor protein, the ecdysone receptor, to form 
an ecdysone-receptor complex. This complex would 
directly induce the transcription of a small set of 
"early" genes responsible for a half dozen immediately 
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induced "early" puffs. These early genes are postulated 
to encode regulatory proteins that "induce the 
transcription of a second set of "late" genes responsible 
for the formation of the "late" puffs. The model thus 
defines a genetic regulatory hierarchy of three ranks, 
the ecdysone-receptor gene in the first rank, the early 
genes in the second rank, and the late genes in the 
third. While this model was derived from the puffing 
pattern observed in a non-imaginal tissue, similar 
genetic regulatory hierarchies may also determine the 
metamorphic changes in development of imaginal tissues 
that are also targets of ecdysone, as well as the changes 
in tissue development induced by the pulses of ecdysone 
that occur at other developmental stages. 

Various structural data have been derived from 
receptors for vertebrate steroids and other lipophilic 
receptor proteins. A "super family" of such receptors has 
been defined on the basis of their structural 
similarities. See, Evans, "The Steroid and Thyroid 
Hormone Receptor Superfamily," Science 240:889-895 
(1988) ; Green and Chambon, "Nuclear Receptors Enhance Our 
Understanding of Transcription Regulation," Trends in 
Genetics 4:309-314 (1988), both of which are hereby 
incorporated herein by reference. Where their functions 
have been defined, these receptors, complexed with their 
respective hormones, regulate the transcription of their 
primary target genes, as proposed for the ecdysone 
receptor in the above model. 

Cultivated agriculture has greatly increased 
efficiency of food production in the world. However, 
various insect pests exploit cultivated sources of food 
to their own advantage. These insect pests typically 
develop by a temporal sequence of events characteristic 
of their order. Many, including Drosophila , initially 
develop in a caterpillar or maggot-like larval form. 
Thereafter, they undergo a metamorphosis from which 
emerges an adult having characteristic anatomical 
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features. Anatomic similarity is a reflection of 
developmental , physiological and- biochemicail similarities 
shared by these insects* In particular, the principles 
governing the role of insect ecdysteroid-hormone 
receptors in Drosophila development, as described above, 
likely are shared by many different types of insects. 

As one weapon against the destruction of cultivated 
crops by insects, organic molecules with pesticidal 
properties are used commonly in attempts to eliminate 
insect populations. However, the ecological side effects 
of these pesticides, due in part to their broad activity 
and lack of specificity, and also in part to the fact 
that some of these pesticides are not easily 
biodegradable, significantly affect populations of both 
insects and other species of animals. Some of these 
populations may be advantageous from an ecological or 
other perspective. Furthermore, as the insect 
populations evolve to minimize the effects of the applied 
pesticides, greater amounts of pesticides must be 
applied, causing significant direct and indirect effects 
on other animals, including humans. Thus, an important 
need exists for both highly specific and highly active 
pesticides which are biodegradable. Novel insect 
hormones which, like the ecdysteroids, act by complexing 
with insect members of the steroid receptor superfamily 
to control insect development, are likely candidates for 
pesticides with these desirable properties. 

The use of insect hormones may also have other 
important applications. Many medically and commercially 
important proteins can be produced in a usable form by 
genetically engineered bacteria. However, many expressed 
proteins are processed incorrectly in bacteria and are 
preferably produced by genetically engineered eucaryotic 
cells. Typically, yeast cells or mammalian tissue- 
culture cells are used. Because it has been observed 
that protein processing of foreign proteins in yeast 
cells is also frequently inappropriate, mammalian 
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cultured cells have become the central focus for the 
production of many proteins. It is commonly known that 
the production of large amounts of foreign proteins makes 
these cells unhealthy, which may affect adversely the 
5 yield of the desired protein. This problem may be 

circumvented, in part, by using an inducible expression 
system^ In such a system, the cells are engineered so 
that they do not express the foreign protein until an 
inducing agent is added to the growth medium. In this 

10 way, large quantities of healthy cells can be produced 

and then induced to produce large amounts of the foreign 
protein. Unfortunately, in the presently available 
systems, the inducing agents themselves, such as metal 
ions or high temperature, adversely affect the cells, 

15 thus again lowering the yield of the desired foreign 

protein the cells produce. A need therefore exists for 
the development of benign inducing factors for efficient 
production of recombinant proteins. Such factors could 
also prove invaluable for the therapy of human patients 

20 suffering from inability to produce particular proteins, 

treatment with these factors controlling both the timing 
and the abundance of the protein produced in the affected 
individual . 

The hormones that complex with mammalian or other 
25 vertebrate members of the steroid receptor superf amily 

are unlikely candidates as such benign factors because 
they would alter the expression of many target genes in 
cells bearing these receptors, thereby adversely 
affecting the host cells. 
30 For these and other reasons, obtaining steroid 

receptors or nucleic acid information about them has been 
a goal of researchers for several years. Unfortunately, 
efforts have been unsuccessful despite a significant 
investment of resources. The absence of information on 
35 the structure and molecular biology of steroid receptors 

has significantly hindered the ability to produce such 
products . 
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Thus, there exists a need for detailed sequence 
information oil insect members of the steroid receptor 
superfamily, and the genes that encode these receptors 
and for resulting reagents. Reagents are provided which 
are useful in finding new molecules which may act as 
agonists or antagonists of natural insect members of the 
steroid receptor superfamily, or as components of systems 
for highly specific regulation of recombinant proteins in 
mammalian cells. 



SUMMARY OF THE INVENTION 
In accordance with the present invention, isolated 
recombinant nucleic acids are provided which, upon 
expression, are capable of coding for other than a native 

15 vertebrate steroid receptor or fragment thereof. 

These nucleic acids typically comprise a segment having a 
sequence substantially homologous to one or more coding 
regions of domains A, B, D, E, or F from an insect 
steroid receptor superfamily member gene having steroid 

20 binding domain homology, e.g., EcR, DHR3, E75A, or E75B. 

Preferably, the jiucleic acids encode a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member and are capable of 
hybridizing to an insect steroid receptor superfamily 

25 member gene segment tinder selective hybridization 

conditions, usually stringent hybridization conditions. 
Mammalian cells transformed with the nucleic acids are 
also provided. 

In another embodiment, isolated recombinant nucleic 

30 acids are included that have sequence exhibiting identity 

over about 20 nucleotides of a coding segment of an 
insect steroid receptor superfamily member having steroid 
binding domain homology. The nucleic acids can be 
transformed into cells to express a polypeptide which 

35 binds to a control lement responsive to a ligand of an 

insect steroid receptor superfamily. 
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Alternatively, an isolated DNA molecule is provided 

comprising- a - DNA~sequence^cap^ 

steroid receptor superfamily member other than 20-OH 
ecdysone receptor, such as DHR3, E75A, or E75B. The DNA 
5 sequence will generally be present in an expression 

vector and promote transcription of an operably linked 
sequence (e.g., encoding a polypeptide) in response to 
binding by an insect steroid receptor superfamily member. 
Cells comprising the nucleic acids are provided, as are 

10 cells expressing the polypeptides. In certain 

embodiments, non-insect cells will be used, including 
mammalian cells. Also contemplated are recombinant 
nucleic acids comprising a controlling element responsive 
to a ligand, e.g., a ligand which binds to an insect 

15 steroid receptor superfamily member ligand responsive 

controlling element, a non-heat shock promoter sequence 
(e.g., an alcohol dehydrogenase promoter) and a sequence 
comprising a reporter gene. Usually the controlling 
element will operate to make transcription of the 

20 reporter gene responsive to the presence of the ligand. 

Additional embodiments of the present invention 
include polypeptides comprising an insect steroid 
receptor superfamily member or fragment thereof, wherein 
such polypeptide is substantially free of naturally- 

25 associated insect cell components and exhibits a 

biological activity characteristic of an insect steroid 
receptor superfamily member with a hormone binding 
domain. Preferably, the insect steroid receptor 
superfamily member or fragment thereof also comprises a 

30 DNA binding domain and the polypeptide is capable of 

binding to a hormone analogue selected from the group 
consisting of an insect hormone, an insect hormone 
agonist and an insect hormone antagonist. The 
polypeptide can comprise a zinc-finger domain and usually 

35 is capable of binding to a DNA controlling element 

responsive to an insect hormone. As desired, the 
polypeptide will be fused to a second polypeptide, 
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typically a heterologous polypeptide which comprises a 
second steroid receptor" superfamily member* Cells, often 
mammalian cells, comprising the protein are provided. 

Fragments of such polypeptides can have a sequence 
substantially homologous to consensus El, E2 or E3 region 
sequences. By way of example, a preferred fragment has a 
sequence comprising: 

a segment at least about 25% homologous to a 
consensus El region sequence; 

a segment at least about 30% homologous to a 
consensus E2 region sequence; and 

a segment at least about 30% homologous to a 
consensus E3 region sequence. 

The polypeptides of the present invention have a 
variety of utilities. For example, a method for 
selecting DNA sequences capable of being specifically 
bound by an insect steroid receptor superfamily member 
can comprise the steps of screening DNA sequences for 
binding to such polypeptides and selecting DNA sequences 
exhibiting such binding. Alternatively, methods for 
selecting ligands, e.g., ecdysteroid analogues, specific 
for binding to a hormone binding domain of an insect 
steroid receptor superfamily member can comprise the 
steps of screening compounds for binding to one or more 
superfamily members and selecting compounds exhibiting 
specific binding to the members. 
Also included are methods for modulating insect 
physiology or development (e.g., killing) comprising the 
steps of screening compounds for binding to an insect 
steroid receptor superfamily member, selecting compounds 
exhibiting said binding and administering the ligand to 
an insect. 

Additionally provided are methods for selecting 
ligahds specific for binding to a ligand binding domain 
of an insect steroid receptor superfamily member 
comprising combining: 
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(i) a fusion polypeptide which comprises a 

rigand^M^ functionally 
linked to a DNA binding domain of a 
second steroid receptor superfamily 
5 member? and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of the second nucleic acid 
sequence is responsive to binding by 
10 the DNA binding domain; 

screening compounds for an activity of inducing 
expression of the second polypeptide; and 

selecting those compounds which do so. 
This will often be performed in a cell, e.g., with cells 
15 transformed with DNA encoding a fusion protein. This 

method allows selction of analogues which are useful in 
modulating insect physiology or development. 

Also provided are methods for producing a 
polypeptide comprising the steps of: 
20 selecting a cell, typically a mammalian or plant 

cell which is substantially insensitive to 
exposure of an insect steroid receptor 
superfamily ligand; 
introducing into said cell: 
25 (i) a receptor for the ligand; and 

(ii) a nucleic acid sequence encoding the 

polypeptide, the nucleic acid 
sequence operably linked to a 
controlling element responsive 
30 to presence of the selected 

ligand, wherein a transformed 
cell is produced; and 
exposing the transformed cell to the ligand. 
Usually the cell will be a mammalian cell, and will 
35 sometimes be introduced into a whole organism, e.g., a 

plant or animal. 
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Kits for the determination of expression levels of 
the nucleic acids and proteins provided herein are made 
available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
5 binds to the desired target molecule, e.g., ligand 

analogues, receptors, or nucleic acids. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. pMTEcR, a Cu 2+ -inducible EcR expression plasmid. 

10 The Pot, EcR ORF and Act 5c poly A elements are defined in 

Experimental Example III, part A. The HYG r ORF confers 
hygromycin resistance and is under control of the 
promoter in the LTR of Drosophila transposable elements, 
copia. The SV40 intron/poly A element provides an intron 

15 for a possible splicing requirement, as well as a 

polyadenylation/cleavage sequence for the HYG r ORF mRNA. 
The pAT153 DNA derives from a bacterial plasmid. 
Figure 2. The ecdysone-inducible pEcRE/Adh/figal reporter 
plasmid. See the text of Experimental Example III, part 

20 B, for the construction of this plasmid and the 

definitions of all symbols (except the SV40 splice and 
poly A) which are defined in the figure legend. 
Figure 3. The constitutive EcR expression plasmid, 
pActEcR. The construction of this plasmid and the 

25 definition of the symbols are given in Experimental 

Example III, part B. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention provides novel isolated 
nucleic acid sequences encoding polypeptide products 
exhibiting the structure and/or activities of insect 
members of the steroid receptor superfamily. Having 
elucidated the structures of these insect steroid 
receptors from their genes, the separate ligand-binding 
domains and DNA-binding domains are used individually or 
in combination to screen for new ligands or DNA sequences 
which bind to these domains. Thus, for example, by 
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binding to promoter sequences incorporating a DNA binding 
site, these receptors will usually control expression of 
reporter genes for which sensitive assays exist. Or, the 
hormone-binding domains serve as reagents for screening 
5 for agonists or antagonists of steroid receptor 

superfamily members. Either new classes of molecules, or 
selected modifications of known ligands will be screened 
for receptor binding. New ligands obtained in this way 
find use as highly specific and highly active naturally 

10 occurring pesticides. Alternatively, structural 

information about interactions between the ligand and 
binding domains directs methods for mutagenizing or 
substituting particular residues in the binding domains, 
thereby providing for altered binding specificity. Thus, 

15 inter alia , the present invention provides for screening 

for new ligand molecules, for the design of new ligand- 
binding domain interactions, for producing novel chimeric 
steroid receptor superfamily members and for generating 
new combinations of ligands and binding domains. 

20 The present invention also provides for the 

isolation or identification of new steroid hormone- 
responsive elements and associated genes. By appropriate 
operable linkage of selected sequences to DNA controlling 
elements which are responsive to binding by the DNA- 

25 binding domains of steroid receptor superfamily members, 

new regulatory combinations result. The present 
invention further provides for the design of either a 
binding domain in a member of the insect steroid receptor 
superfamily that will recognize given DNA sequences, or 

30 conversely for the modification of DNA sequences which 

will bind to particular receptor DNA-binding domains. 
Both the DNA-binding domain of a superf amily-member 
polypeptide and its DNA recognition sequence can be 
coordinately modified to produce wholly new receptor-DNA 

3 5 interactions . 

In an alternative embodiment, a DNA-binding sequence 
recognized by a selected receptor will be operably linked 
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to a desired genetic sequence for inducible expression. 
Thus, upon administration "of a ligand specific for that 
selected receptor, the genetic sequence is appropriately 
regulated. Expression systems are constructed that are 
5 responsive to administration of insect steroid receptor 

superfamily-specific ligands. By identifying and 
isolating new members of the insect steroid receptor 
superfamily, new, useful regulatory reagents become 
available, both hormones and controlling elements. 

10 In another embodiment, highly regulatable expression 

of a gene is achieved by use of regulatory elements 
responsive to ligands specific to the superfamily 
members. If transformed cells are grown under conditions 
where expression is repressed or not induced, the cells 

15 will grow to higher densities and enjoy less stressful 

conditions. Upon reaching high density, the regulatory 
ligand molecule is added to cause high expression. 
Selected cells otherwise insensitive to the inducing 
ligand will not be affected by exposure to the ligand 

20 used to regulate expression. This provides a means both 

for highly efficient regulatable expression of genes, and 
for introduction of these genes into intact organisms. 

In accordance with specific embodiments of the 
present invention, nucleic acid sequences encoding 

25 portions of insect steroid hormone receptor superfamily 

members have been elucidated. DNA encoding four 
different members of the Drosophila steroid receptor 
superfamily have been characterized: (1) the 20-OH 
ecdysone receptor, also called the ecdysone receptor 

30 (EcR) , for which a full-length encoding sequence has been 

determined; (2) Drosophila hormone receptor 3 (DHR3) , a 
protein with sequence homology to various steroid 
receptor superfamily members; (3 and 4) E75A and E75B, 
closely related proteins, encoded by segments of the same 

35 gene, and each possessing sequence homology to other 

steroid receptor superfamily members. 
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The DNA sequences encoding each of these members of 
the insect steroid" receptor superf amily provide~probes 
for screening for homologous nucleic acid sequences, both 
in Drosophila and other sources. This screening allows 
5 isolation of homologous genes from both vertebrates and 

invertebrates. Production of large amounts of the 
encoded proteins is effected by inserting those sequences 
into expression systems. 

The EcR, DHR3, E75A, and E75B genes are each linked 

10 to similar DNA sequences which likely function as 

controlling , or regulatory elements which are responsive 
to insect steroids. The present invention provides for 
the isolation of these hormone-responsive control 
elements, and for their use in regulating gene 

15 expression. One embodiment of a DNA construct comprises: 

(1) multiple copies of an insect steroid receptor 
superf amily controlling element linked to (2) a minimal 
gene promoter, preferably not a heat shock gene promoter, 
which provides highly inducible expression of (3) an 

20 operably linked gene. This construct provides a very 

sensitive assay for the presence of the controlling 
molecule of the receptor. 

Another aspect of the present invention involves 
cells comprising: (1) isolated recombinant gene segments 

25 encoding biologically active fragments of insect steroid 

receptor superf amily proteins; (2) DNA sequences which 
bind insect steroid receptors, e.g., the elements 
involved in hormone-responsive control; or (3) modified 
receptor proteins. Transformed cells are understood to 

30 include their progeny. In particular, the present 

invention provides for a system whereby expression of 
polypeptides is responsive to steroid induction • For 
instance, a system which expresses a desired protein in 
response to exposure to ecdysone analogues is constructed 

35 by operably linking a promoter having an ecdysone- 

responsive enhancer to a peptide encoding segment. 
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The present invention also provides insect steroid 
receptor proteins substantially free from naturally- 
associated insect cell components. Such receptors will 
typically be either full-length proteins , functional 
fragments, or fusion proteins comprising segments from an 
insect steroid receptor protein fused to a heterologous, 
or normally non-contiguous, protein domain. 

The present invention further provides a number of 
methods for utilizing the subject receptor proteins. One 
aspect of the present invention is a method for selecting 
new hormone analogues. The isolated hormone-binding 
domains specifically bind hormone ligands, thereby 
providing a means to screen for new molecules possessing 
the property of binding with high affinity to the ligand- 
binding region. Thus, a binding domain of an insect 
steroid receptor superfamily member will be used as a 
reagent to develop a binding assay. On one level, the 
binding domains are useful as affinity reagents for a 
batch or a column selective process, i.e., to selectively 
retain ligands which bind. Alternatively, a functional 
assay is preferred for its greater sensitivity to ligand- 
binding, whether a direct binding assay or an indirect 
assay in which binding is linked to an easily assayed 
function. For example, by operable linkage of an easily 
assayable reporter gene to a controlling element 
responsive to binding by an insect steroid receptor 
superfamily member, in which ligand-binding induces 
protein synthesis, an extremely sensitive assay for the 
presence of a ligand or of a receptor results. Such a 
construct useful for assaying the presence of 20-OH 
ecdysone is described below. This construct is useful 
for screening for agonists or antagonists of the 20-OH 
ecdysone ligand. 

In particular, this method allows selecting for 
ligands which bind to an "orphan" receptor, i.e., a 
receptor whose ligand is unknown. Binding domains for 
"unknown" ligands will often originate from either newly 
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identified insect steroid receptor superfamily members, 
or from mutagenesis. A hybrid recept r will be created 
with a ligand-binding domain and DNA-binding domain from 
different sources. For example, a hybrid receptor 
5 between a putative binding domain and a known DNA-binding 

domain would allow screening for ligands. An "orphan 
receptor" binding domain will be functionally linked to a 
known DNA-binding domain which will control a known 
reporter gene construct whose expression will be easily 
10 detected. This system for ligand-receptor binding 

provides an extremely sensitive assay for ligand-receptor 
interactions. 

Alternatively, the recognition of important features 
of tertiary structure and spatial interactions between a 

15 ligand-binding domain from an insect steroid receptor 

superfamily member and its ligand will allow selection of 
new combinations of ligand-binding domains with ligands. 
Either method provides for selecting unusual ligands 
which specifically bind a modified polypeptide-binding 

20 domain of a receptor. This approach allows selection of 

novel steroid hormone analogues which exhibit modified 
specificity for binding to a subgroup of steroid 
receptors . 

The present invention also provides for new and 
25 useful combinations of the various related components: 

the recombinant nucleic acid sequences encoding the 
polypeptides, the polypeptide sequences, and the DNA 
sites to which the receptors bind (i.e., the regulatory, 
or control, elements). For instance, fusing portions of 
30 nucleic acid sequences encoding peptides from different 

sources will provide polypeptides exhibiting hybrid 
properties, e.g., unusual control and expression 
characteristics. In particular, hybrid receptors 
comprising segments from other members of the 
35 superfamily, or from other sources, will be made. 

Combining an insect steroid receptor-responsive enhancer 
segment with a different polypeptide encoding segment 
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will produce a steroid-responsive expression system for 
that polypeptide. 

The isolation of insect steroid receptors provides 
for isolation or screening of new ligands for receptor 
binding. Some of these will interfere with, or disrupt, 
normal insect development. These reagents will allow the 
user to accelerate or decelerate insect development, for 
instance, in preparing sterile adults for release. 
Alternatively, a delay or change in the timing of 
development will often be lethal or will dramatically 
modify the ability of an insect to affect an agricultural 
crop. Thus, naturally occurring biodegradable and highly 
active molecules able to disrupt the timing of insect 
development will result. 

Furthermore, these polypeptides provide the means by 
which have been raised antibodies possessing specificity 
for binding to particular steroid receptor classes. 
Thus, reagents will be produced for determining, 
qualitatively or quantitatively the presence of these or 
homologous polypeptides. Alternatively, these antibodies 
will be used to separate or purify receptor polypeptides. 

Transcription se quences of insect steroid receptor 
superfamilv members 

The ecdysone receptor gene is a member of the 
steroid and thyroid hormone receptor gene super family, a 
group of ligand-responsive transcription factors. See . 
Evans (1988) Science 240:889-895; and Segraves, Molecular 
and Genetic Analysis of the E75 Ecdvsone-Respons ive Gene 
of Dros ophila melanogaster (Ph.D. thesis, Stanford 
University 1988), both of which are hereby incorporated 
herein by reference for all purposes. These receptors 
show extensive sequence similarity, especially in their 
"zinc finger" DNA-binding domains, and also in a ligand 
(or hormone or steroid) binding domain. Modulation of 
gene expression apparently occurs in response to binding 
of a receptor to specific control, or regulatory, DNA 
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elements. The cloning of receptor cDNAs provides the 

first-opportunity-tostudy- the-molecular bases- of steroid 

action. The steroid receptor superfamily is a class of 
receptors which exhibit similar structural and functional 
5 features. While the term insect is used herein , it will 

be recognized that the same methods and molecules will be 
derived from other species of animals , in particular , 
those of the class Insecta . or, more broadly, members of 
the phylum Arthropoda which use ecdysteroids as hormones. 

10 Members of the insect steroid receptor superfamily are 

characterized by functional ligand-binding and DNA 
binding domains, both of which interact to effect a 
change in the regulatory state of a gene operably linked 
to the DNA-binding site of the receptor. Thus, the 

15 receptors of the insect steroid receptor superfamily seem 

to be ligand-responsive transcription factors. The 
receptors of the present invention exhibit at least a 
hormone-binding domain characterized by sequence homology 
to particular regions, designated £1, £2 and £3. 

20 The members of the insect steroid receptor 

superfamily are typically characterized by structural 
homology of particular domains, as defined initially in 
the estrogen receptor. Specifically, a DNA-binding 
domain, C, and a ligand-binding domain, E, are separated 

25 and flanked by additional domains as identified by Krust 

et al. (1986) EMBO J. 5:891-897, which is hereby 
incorporated herein by reference. 

The C domain, or zinc-finger DNA-binding domain, is 
usually hydrophilic, having high cysteine, lysine and 

30 arginine content — a sequence suitable for the required 

tight DNA binding. The E domain is usually hydrophobic 
and further characterized as regions El, E2 and E3. The 
ligand-binding domains of the present invention are 
typically characterized by having significant homology in 

35 sequence and structure to these three regions. Amino 

proximal to the C domain is a region initially defined as 
separate A and B domains. Region D separates the more • 
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conserved domains C and E. Region D typically has a 
hydrophil-ic region whose predicted secondary structure "is 
rich in turns and coils. The F region is carboxy 
proximal to the E region (see, Krust et al., supra). 

The ligand-binding domain of the members of the 
insect steroid receptor superfamily is typically 
carboxyl-proximal , relative to a DNA-binding domain 
described below. See . Evans (1988) Science 240:889-895. 
The entire hormone-binding domain is typically between 
about 200 and 250 amino acids but is potentially shorter. 
This domain has the subregions of high homology, 
designated the El, E2 and E3 regions. See , e.g., 
Table 4. 

The El region is 19 amino acids long with a 
consensus sequence AKX(L/I)PGFXXLT(L/I) (D/E) DQITLL, where 
X represents any amino acid and the other letters are the 
standard single-letter code. Positions in parentheses 
are alternatives. Typically, members of the insect 
steroid receptor superfamily will have at least about 
five matches out of the sixteen assigned positions, 
preferably at least about nine matches, and in more 
preferred embodiments, at least about ten matches. 
Alternatively, these insect steroid receptor superfamily 
members will have homologous sequences exhibiting at 
least about 25% homology, normally at least about 30% 
homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 
least about 45% homology, typically at least about 50% 
homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 
at positions assigned preferred amino acids. 

The E2 region is a 19 amino-acid segment with a 
consensus sequence: 

E(F/Y) (A/V) (L/C) (L/M)KA(I/L) (V/L)L(L/I) 
(N/S) (S/P)D(P/-) (R/K) (P/D)GL , 
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where - represents an optional absence of an amino acid. 
Typically an insect steroid receptor superfamily member 
will exhibit at least about six matches, preferably at 
least about eight matches and more preferably at least 
5 about nine matches. Alternatively, E2 sequences of 

insect steroid receptor superfamily members exhibit at 
least about about 25% homology, normally at least about 
30% homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 

10 least about 45% homology, typically at least about 50% 

homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 

15 at positions assigned preferred amino acids. 

The E3 region is a 12 amino-acid segment with a 
consensus sequence 

LXKLLXXLPDLR . 
The insect steroid receptor superfamily members will 

20 typically show at least about four matches out of the 

nine assigned preferences in the £3 region, preferably at 
least about five matches and more preferably at least 
about six matches. Alternatively, over the assigned 
positions, members of the insect steroid receptor 

25 superfamily will exhibit at least about 25% homology, 

normally at least about 30% homology, more normally at 
least about 35% homology, generally at least about 40% 
homology, more generally at least about 45% homology, 
typically at least about 50% homology, more typically at 

30 least about 55% homology, usually at least about 60% 

homology, more usually at least about 70% homology, 
preferably at least about 80% homology, and more 
preferably at least about 90% homology at positions 
assigned preferred amino acids. 

35 In preferred embodiments, the insect steroid 

receptor superfamily members will exhibit matching of at 
least about five positions in an El region, at least 
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about six positions in an E3 region and at least about 
four- positions in an E3 region. The Ely E2, and £3 
regions are defined, e.g., in Table 4. 

The DNA-binding domain of these insect steroid 
receptor super family members is characterized by a "zinc 
fingers" motif. See , Evans (1988) Science 240:889-895. 
The domain is typically amino proximal to the ligand, or 
hormone, binding site. Typically, the DNA-binding domain 
of the insect steroid receptor superfamily members is 
characterized by clustering of basic residues, a 
cysteine-rich composition and sequence homology. See , 
Evans, R. M. (1988) Science 240:889-89; and Experimental 
section below. Significant polypeptide sequence homology 
among superfamily members exists. The insect steroid 
receptor superfamily members will exhibit at least about 
30% homology in the 67 + 1 amino acid region of this 
domain, normally at least about 40% homology, usually at 
least about 45% homology, and preferably at least about 
55% homology. 

Steroids are derivatives of the saturated 
tetracyclic hydrocarbon perhydrocyclopentanophenanthrene. 
Among the molecules in the group "steroids" are: the bile 
acids, e.g., cholic acid, and deoxycholic acid; the 
adrenocortical steroids, e.g., corticosterone and 
aldosterone; the estrogens, e.g., estrone and 6- 
estradiol; the androgens, e.g., testosterone and 
progesterone; and the ecdysteroids. The terms steroid or 
steroid hormones are used interchangeably herein and are 
intended to include all steroid analogues. Typically, 
steroid analogues are molecules which have minor 
modifications of various peripheral chemical groups. 
See, Koolman (ed.) (1989), cited above, for details on 
ecdysteroids . 

Although ligands for the insect steroid receptor 
superfamily members have historically been characterized 
as steroids, the term "steroid" as in "insect steroid 
receptor superfamily" is not meant only literally. The 
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use of "steroid" has resulted from a historical 
designation of members of a group recognized initially to 
include only molecules having specific defined molecular 
structures. However, this limitation is no longer 
5 applicable since functions are no longer only correlated 

with precise structures. Thus, there will be members of 
the insect steroid receptor super family, as defined 
herein, whose ligand-binding specificities are not 
directed to classically defined "steroids." Typically, 

10 the ligands for members of the insect steroid receptor 

superfamily are lipophilic molecules which are structural 
analogues of steroid molecules. 

The term ligand is meant to refer to the molecules 
that bind the domain described here as the "hormone- 

15 binding domain." Also, a ligand for an insect steroid 

receptor superfamily member is a ligand which serves 
either as the natural ligand to which the member binds, 
or a functional analogue which serves as an agonist or 
antagonist. The classical definition of "hormone" has 

20 been defined functionally by physiologists, see, e.g., 

Guyton, Textbook of Medical Physiology . Saunders, 
Philadelphia. The functional term "hormone" is employed 
because of historic usage, but is meant to apply to other 
chemical messengers used to communicate between cell 

25 types. Recently the distinction between hormones and 

neurotransmitters has been eroded as various peptide 
neurotransmitters have been shown to exhibit properties 
of classically defined hormones. These molecules are 
typically used in intercellular signal transduction, but 

30 are not limited to those molecules having slow or 

systemic effects, or which act at remote sites. 

Substantial homology in the nucleic acid context 
means either that the segments, or their complementary 
strands, when compared, are identical when optimally 

35 aligned, with appropriate nucleotide insertions or 

deletions, in at least about 40% of the residues, 
generally at least about 45%, more generally at least 
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about 50% , normally at least about 55%, more normally at 
least about 60%, typically at least about 65%, more 
typically at least about 70%, usually at least about 75%, 
more usually at least about 80%, preferably at least 
5 about 85%, and more preferably at least about 95% of the 

nucleotides. Alternatively, substantial homology exists 
when the segments will hybridize under selective 
hybridization conditions, to a strand or its complement,, 
typically using a sequence derived from Table 1, 2 or 3. 

10 Selectivity of hybridization exists when hybridization 

occurs which is more selective than total lack of 
specificity. Normally, selective hybridization will 
occur when there is at least about 55% homology over a 
stretch of at least about 14 to 25 nucleotides, generally 

15 at least about 65%, typically at least about 75%, usually 

at least about 85%, preferably at least about 90%, and 
more preferably at least about 95% or more. See , 
Kanehisa, M. (1984), Nucleic Acids Res. 12:203-213, which 
is incorporated herein by reference. Stringent 

20 hybridization conditions will include salt concentrations 

of less than about 2.5 M, generally less than about 1.5 
M, typically less than about 1 M, usually less than about 
500 mM, and preferably less than about 200 mM. 
Temperature conditions will normally be greater than 

25 20°C, more normally greater than about 25°C, generally 

greater than about 30 °C, more generally greater than 
about 35 °C, typically greater than about 40 °C, more 
typically greater than about 45 °C, usually greater than 
about 50 ^C, more usually greater than about 55 *C, and in 

30 particular embodiments will be greater than 60 # C, even as 

high as 80 *C or more. As other factors may significantly 
affect the stringency of hybridization, including, among 
others, base composition and size of the complementary 
strands, presence of organic solvents and extent of base 

35 mismatching, the combination of parameters is more 

important than the absolute measure of any one. 



WO 91/13167 



PCT/US91/01189 



23 

A gene for an insect steroid receptor superf amily 
- member gene includes its upstream ~(e .g . , promoter) and 
downstream operably linked controlling elements, as well 
as the complementary strands. See, generally, Watson et 
al. (1987) The Molecular Biology of the Gene, Benjamin, 
Menlo Park, which is hereby incorporated herein by 
reference. A gene geneally also comprises the segment 
encoding the transcription unit, including both introns 
and exons. Thus, an isolated gene allows for screening 
for new steroid receptor genes by probing for genetic 
sequences which hybridize to either controlling or 
transcribed segments of a receptor gene of the present 
invention. Three segments of particular interest are the 
controlling elements, both upstream and downstream, and 
segments encoding the DNA-binding segments and the 
hormone-binding segments. Methods applicable to such 
screening are analogous to those generally used in 
hybridization or affinity labeling. 

Nucleic acid probes will often be labeled using 
radioactive or non-radioactive labels, many of which are 
listed in the section on polypeptide labeling. Standard 
procedures for nucleic acid labeling are described, e.g., 
in Sambrook et al. (1989); and Ausubel et al. (1987 and 
supplements) • 

Insect steroid receptor superfamilv member polypeptides 
A polypeptide sequence of the ecdysone receptor is 
represented in Table 2. Other insect steroid receptor 
superfamily member polypeptide sequences are set forth in 
Tables 1 and 3. Preferred nucleic acid sequences of the 
cDNAs encoding these insect steroid receptor super family 
member polypeptides are also provided in the 
corresponding tables. Other nucleic acids will be used 
to encode the proteins, making use of the degeneracy or 
non-universality of the genetic code. 

As used herein, the term "substantially pure" 
describes a protein or other mat rial, e.g., nucleic 
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acid, which has been separated from its native 
contaminants. Typically, a monomer ic protein is 
substantially pure when at least about 60 to 75% of a 
sample exhibits a single polypeptide backbone. Minor 
variants or chemical modifications typically share the 
same polypeptide sequence. Usually a substantially pure 
protein will comprise over about 85 to 90% of a protein 
sample, and preferably will be over about 99% pure. 
Normally, purity is measured on a polyacryl amide gel, 
with homogeneity determined by staining. Alternatively, 
for certain purposes high resolution will be necessary 
and HPLC or a similar means for purification will be 
used. For most purposes, a simple chromatography column 
or polyacrylamide gel will be used to determine purity. 

The term "substantially free of naturally-associated 
insect cell components" describes a protein or other 
material, e.g., nucleic acid, which is separated from the 
native contaminants which accompany it in its natural 
insect cell state. Thus, a protein which is chemically 
synthesized or synthesized in a cellular system different 
from the insect cell from which it naturally originates 
will be free from its naturally-associated insect cell 
components. The term is used to describe insect steroid 
receptor superf amily members and nucleic acids which have 
been synthesized in mammalian cells or plant cells, E. 
coli and other procaryotes. 

The present invention also provides for analogues of 
the insect steroid receptor superf amily members. Such 
analogues include both modifications to a polypeptide 
backbone, e.g., insertions and deletions, genetic 
variants, and mutants of the polypeptides. Modifications 
include chemical derivatizations of polypeptides, such as 
acetylations, carboxylations and the like. They also 
include glycosylation modifications and processing 
variants of a typical polypeptide. These processing 
steps specifically include enzymatic modifications, such 
as ubiquitinylation. See , e.g., Hershko and Ciechanover 
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(1982), "Mechanisms of Intracellular Protein Breakdown," 
Ann, Rev, Bioch. . 51: 335-364. 

Other analogues include genetic variants, both 
natural and induced. Induced mutants are derived from 
5 various techniques, e.g., random mutagenesis using 

reagents such as irradiation or exposure to EMS, or 
engineered changes using site-specific mutagenesis 
techniques or other techniques of modern molecular 
biology. See, e.g., Sambrook et al. (1989) Molepul^y 

10 Cloninat A Laboratory Manual (2d ed.), CSH Press; and 

Ausubel et al. (1987 and supplements) Current Protocols 
in Molecular Biology , Greene/Wiley, New York, each of 
which is hereby incorporated herein by reference. 

As described above, the DNA-binding zinc fingers 

15 segment of a receptor shows high specificity of 

recognition of specific target DNA sequences. An 
understanding of the DNA-protein binding interactions 
provides for the modification in a rational manner of 
either DNA or protein characteristics, or both, to effect 

20 specificity of binding for modulation of enhancer 

activity. More importantly, isolation of genes for new 
members of the insect steroid receptor superfamily allows 
their use to produce the receptor polypeptides and to 
isolate new controlling elements. By using the DNA- 

25 binding domains, as described above, controlling elements 

which are responsive to the ligands bound by the 
corresponding superfamily members are identified and 
isolated. This procedure shall yield a variety of 
controlling elements responsive to ligands. By the 

30 methods described above, the ligands for any particular 

member of the insect steroid receptor superfamily will be 
identified. 

The controlling elements typically are enhancers, 
but also include silencers or various other types of 
35 ligand-r sponsive elements. They usually operate over 

large distances, but will typically be within about 
50 kb, usually within about 35 kb, more usually within 
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about 20 kb and preferably within about 7 kb of the genes 
that these elements regulate. 

Polypeptide fr agments and fusions 

Besides substantially full-length polypeptides, the 
present invention provides for biologically active 
fragments of the polypeptides. Significant biological 
activities include ligand-binding, DNA binding, 
immunological activity and other biological activities 
characteristic of steroid receptor superfamily members. 
Immunological activities include both immunogenic 
function in a target immune system, as well as sharing of 
immunological epitopes for binding, serving as either a 
competitor or substitute antigen for a steroid receptor 
epitope. 

For example, ligand-binding or DNA-binding domains 
from different polypeptides will be exchanged to form 
different or new fusion polypeptides or fragments. Thus, 
new chimaeric polypeptides exhibiting new combinations of 
specificities result from the functional linkage of 
ligand-binding specificities to DNA-binding domains. 
This is extremely useful in the design of inducible 
expression systems. 

For immunological purposes, immunogens will 
sometimes be produced from tandemly repeated polypeptide 
segments, thereby producing highly antigenic proteins. 
Alternatively, such polypeptides will serve as highly 
efficient competitors for specific binding. Production 
of antibodies to insect steroid receptor superfamily 
members is described below. 

The present invention also provides for other 
polypeptides comprising fragments of steroid receptor 
superfamily members. Fusion polypeptides between the 
steroid receptor segments and other homologous or 
heterologous proteins are provided, e.g., polypeptide 
comprising contiguous peptide sequences from different 
proteins. Homologous polypeptides will often be fusions 
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between different steroid receptor superfamily members, 
resulting in, for instance, a hybrid protein exhibiting 
ligand specificity of one member and DNA-binding 
specificity of another. Likewise, heterologous fusions, 
5 derived from different polypeptides, will be constructed 

which would exhibit a combination of properties or 
activities of the parental proteins. Typical examples 
are fusions of a reporter polypeptide, e.g., luciferase, 
with another domain of a receptor, e.g., a DNA-binding 

10 domain, so that the presence or location of a desired 

ligand is easily determined. See , e.g., Dull et al., 
U.S. No . 4 , 859 , 609 , which is hereby incorporated herein 
by reference. Other typical gene fusion partners include 
"zinc finger" segment swapping between DNA-binding 

15 proteins, bacterial B-galactosidase, trpE Protein A, 

B-lactamase, alpha amylase, alcohol dehydrogenase, and 
yeast alpha mating factor. See , e.g., Godowski et al. 
(1988), Science 241:812-816; and Experimental section 
below. 

20 

Insect steroid receptor superfamily member expression 

With the sequence of the receptor polypeptides and 
the recombinant DNA sequences encoding them, large 
quantities of members of the insect steroid receptor 

25 superfamily will be prepared. By the appropriate 

expression of vectors in cells, high efficiency protein 
production will be achieved. Thereafter, standard 
protein purification methods are available, such as 
ammonium sulfate precipitation, column chromatography, 

30 electrophoresis, centr if ugation , crystallization, and 

others. See, e.g., Deutscher (1990) "Guide to Protein 
Purification" Methods in EnzymolocrY . vol 182 and others; 
and Ausubel et al. (1987 and supplements) Current 
Protocols in Molecular Biology , for techniques typically 

35 used for protein purification. Alternatively, in some 

embodiments high efficiency of production is unnecessary, 
but the presence of a known inducing protein within a 
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carefully engineered expression system is quite valuable. 
For- instance, a combination of: (1) a ligahd-respbnsive 
enhancer of this type operably linked to (2) a desired 
gene sequence with (3) the corresponding insect steroid 
5 receptor superfamily member will be placed together in an 

expression system provides a specifically inducible 
expression system. The desired gene sequence will encode 
a protein of interest, and the corresponding steroid 
receptor member will often be the ecdysone receptor. 

10 Typically, the expression system will be a cell, but in 

yj-trp expression systems will also be constructed. 

The desired genes will be inserted into any of a 
wide selection of expression vectors. The selection of 
an appropriate vector and cell line depends upon the 

15 constraints of the desired product. Typical expression 

vectors are described in Sambrook et al. (1989) and 
Ausubel et al. (1987 and supplements). Suitable cell 
lines are available from a depository, such as the ATCC. 
See, ATCC Catalogue of Cell Lines and Hybridomas (6th 

20 ed.) (1988); ATCC Cell Lines, Viruses, and Antisera, each 

of which is hereby incorporated herein by reference. The 
vectors are introduced to the desired cells by standard 
transformation or transfection procedures as described, 
for instance, in Sambrook et al. (1989). 

25 Fusion proteins will typically be made by either 

recombinant nucleic acid methods or by synthetic 
polypeptide methods. Techniques for nucleic acid 
manipulation are described generally, for example, in 
Sambrook et al. (1989), Molecular Cloning; A Laboratory 

30 Manual (2d ed.), Vols. 1-3, Cold Spring Harbor 

Laboratory, which are incorporated herein by reference. 
Techniques for synthesis of polypeptides are described, 
for example, in Merrifield, J. Amer. Chem. Soc. 85:2149- 
2156 (1963). 

35 The recombinant nucleic acid sequences used to 

produce fusion proteins of the present invention will 
typically be derived from natural or synthetic sequences. 
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Many natural gene sequences are obtainable from various 
cDNA or from genomic 1 ibraries us Ting appropriate probes . 
See . GenBank™, National Institutes of Health. Typical 
probes for steroid receptors are selected from the 
sequences of Tables 1, 2 or 3 in accordance with standard 
procedures. The phosphoramidite method described by 
Beaucage and Carruthers, Tetra. Letts. 22:1859-1862 
(1981) will produce suitable synthetic ONA fragments. A 
double stranded fragment is then obtainable either by 
synthesizing the complementary strand and annealing the 
strand together under appropriate conditions or by adding 
the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

With the isolated steroid receptor genes, segments 
of the transcribed segments are available as probes for 
isolating homologous sequences usually from different 
sources, e.g., different animals. By selection of the 
segment used as a probe, particular functionally 
associated segments will be isolated. Thus, for example, 
other nucleic acid segments encoding either ligand- 
binding or DNA-binding domains of new receptors will be 
isolated. Alternatively, by using steroid-responsive 
controlling elements as a probe, new steroid-responsive 
elements will be isolated, along with the associated 
segment of DNA whose expression is regulated. This method 
allows for the isolation of ligand-responsive genes, many 
of which are, themselves, also members of the insect 
steroid receptor superfamily. 

The natural or synthetic DNA fragments coding for a 
desired steroid receptor fragment will be incorporated 
into DNA constructs capable of introduction to and 
expression in an in vitro cell culture. Usually the DNA 
constructs will be suitable for replication in a 
unicellular host, such as yeast or bacteria, but 
alternatively are intended for introduction to, with or 
without integration into the genome, cultured mammalian 
or plant or other eucaryotic cell lines. DNA constructs 
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prepaxed for introduction into bacteria or yeast will 
typically include a replication system recognized by the" 
host, the intended DNA fragment encoding the desired 
receptor polypeptide, transcription and translational 
initiation regulatory sequences operably linked to the 
polypeptide encoding segment and transcriptional and 
translational termination regulatory sequences operably 
linked to the polypeptide encoding segment. The 
transcriptional regulatory sequences will typically 
include a heterologous enhancer or promoter which is 
recognized by the host. The selection of an appropriate 
promoter will depend upon the host, but promoters such as 
the trp, lac and phage promoters, tRNA promoters and 
glycolytic enzyme promoters are known. See , Sambrook et 
al. (1989). Conveniently available expression vectors 
which include the replication system and transcriptional 
and translational regulatory sequences together with the 
insertion site for the steroid receptor DNA sequence will 
generally be employed. Examples of workable combinations 
of cell lines and expression vectors are described in 
Sambrook et al. (1989); see also , Metzger et al. (1988), 
Nature 334:31-36. 

Genetic constructs 

The DNA segments encoding the members of the insect 
steroid receptor superfamily will typically be utilized 
in a plasmid vector. In one embodiment an expression 
control DNA sequence is operably linked to the insect 
steroid receptor superfamily member coding sequences for 
expression of the insect steroid receptor superfamily 
member alone. In a second embodiment an insect steroid 
receptor superfamily member provides the capability to 
express another protein in response to the presence of an 
insect steroid receptor ligand. This latter embodiment 
is separately described below. The expression control 
sequences will commonly include eukaryotic enhancer or 
promoter systems in vectors capable of transforming or 
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transfecting eucaryotic host cells. Once the vector has 
been introduced into the appropriate host, the" host/ 
depending on the use, will be maintained under conditions 
suitable for high level expression of the nucleotide 
sequences . 

* 

Steroid-responsive expression of selected genes 

For steroid-responsive expression of other genes, 
the steroid receptor gene will typically be cotrans formed 
with a recombinant construct comprising a desired gene 
for expression operably linked to the steroid-responsive 
enhancer or promoter element. In this use, a single 
expression system will typically comprise a combination 
of (1) a controlling element responsive to a ligand of an 
insect steroid receptor superfamily member, (2) a desired 
gene for expression, operably linked to the controlling 
element, and (3) an insect steroid receptor superfamily 
member which can bind to the controlling element. 
Usually, this system will be employed within a cell, but 
an in vitro system is also possible. The insect steroid 
receptor superfamily member will typically be provided by 
expression of a nucleic acid encoding it, though it need 
not be expressed at high levels. Thus, in one preferred 
embodiment, the system will be achieved through 
cotransformation of a cell with both the regulatable 
construct and another segment encoding the insect steroid 
receptor superfamily member. Usually, the controlling 
element will be an enhancer element, but some elements 
work to repress expression. In this embodiment, the 
ligand for the insect steroid receptor superfamily member 
will be provided or withheld as appropriate for the 
desired expression properties. 

A particularly useful genetic construct comprises an 
alcohol dehydrogenase promoter operably linked to an 
easily assayable reporter gene, e.g., 6-galactosidase. 
In a preferred embodiment of this construct, a 
multiplicity of copies of the insect steroid receptor 
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superfamily member is used. For example, operable 
linkage of controlling elements responsive to insect 
steroid receptor superfamily members, e,g., EcR, DHR3 , 
E75A and E75B, to the alcohol dehydrogenase (ADH) 
promoter, or others as described above, and protein 
coding sequences for a particular reporter protein, as 
described above, leads to steroid-responsive expression 
of B-galactosidase. Such a system provides highly 
sensitive detection of expression in response to ligand 
binding, allowing for detection of a productive ligand- 
receptor interaction. 

DNA sequences will normally be expressed in hosts 
after the sequences have been operably linked to (i.e., 
positioned to ensure the functioning of) an expression 
control sequence. These expression vectors are typically 
replicable in the host organisms either as episomes or as 
an integral part of the host chromosomal DNA. Commonly, 
expression vectors will contain selection markers, e.g., 
tetracycline or neomycin, to permit detection of those 
cells transformed with the desired DNA sequences ( see , 
e.g., U.S. Patent 4,704,362, which is incorporated herein 
by reference) . 

E. coli is one procaryotic host useful for cloning 
the DNA sequences of the present invention. Other 
microbial hosts suitable for use include bacilli, such as 
Bacillus subtilis r and other enterobacteriaceae, such as 
Salmonella, Serratia, and various Pseudomonas species. 

Other eucaryotic cells will often be used, including 
yeast cells, insect tissue culture cells, avian cells, or 
the like. Preferably, mammalian tissue cell culture will 
be used to produce the inducible polypeptides of the 
present invention (see, Winnacker, From Genes to Clones , 
VCH Publishers, N.Y. (1987) , which is incorporated herein 
by reference) . Mammalian cells are preferred cells in 
which to use the insect steroid receptor superfamily 
member ligand-responsive gene constructs, because they 
naturally lack the molecules which confer responses to 
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the ligands for insect steroid receptor superfamily 
m mbers. - " 

Mammalian cells are preferred because they are 
insensitive to many ligands for insect steroid receptor 
5 superfamily member. Thus, exposure of these cells to the 

ligands of the insect steroid receptor superfamily 
members typically will have negligible physiological or 
other effects on the cells, or on a whole organism. 
Therefore, cells can grow and express the desired 

10 product, substantially unaffected by the presence of the 

ligand itself. The ligand will function to cause 
response either in the positive or negative direction. 
For example, it is often desirable to grow cells to high 
density before expression. In a positive induction 

15 system, the inducing ligand would be added upon reaching 

high cell density, but since the ligand itself is benign 
to the cells, the only physiological imbalances result 
from the expression, e.g., the product, itself. 
Alternatively, in a negative repression system, the 

20 ligand is supplied until the cells reach a high density. 

Upon reaching a high density, the ligand would be 
removed. Introduction of these cells into a whole 
organism, e.g., a plant or animal, will provide the 
products of expression to that organism. In this 

25 circumstance, the natural insensitivity of cells to the 

ligands will also be advantageous. 

Expression vectors for these cells can include 
expression control sequences, such as an origin of 
replication, a promoter, an enhancer and necessary 

30 processing information sites, such as ribosome-binding 

sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferably, the 
enhancers or promoters will be those naturally associated 
with genes encoding the steroid receptors, although it 

35 will be understood that in many cases others will be 

equally or more appropriate. Other preferred expr ssion 
control sequences are enhancers or promoters derived from 
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viruses, such as SV40, Adenovirus, Bovine Papilloma 
Virus, and the like. 

Similarly, preferred promoters are those found 
naturally in immunoglobulin-producing cells ( see . U.S. 
5 Patent No. 4,663,281, which is incorporated herein by 

reference), but SV40, polyoma virus, cytomegalovirus 
(human or murine) and the LTR from various retroviruses 
(such as murine leukemia virus, murine or Rous sarcoma 
virus and HIV) are also available. See . Enhancers and 

10 Eukarvotic Gene E xpression . Cold Spring Harbor Press, 

N.Y., 1983, which is incorporated herein by reference. 

The vectors containing the DNA segments of interest 
(e.g., the steroid receptor gene, the recombinant 
steroid-responsive gene, or both) can be transferred into 

15 the host cell by well-known methods, which vary depending 

on the type of cellular host. For example, calcium 
chloride transfection is commonly utilized for 
procaryotic cells, whereas calcium phosphate treatment is 
often used for other cellular hosts. See, generally, 

20 Sambrook et al. (1989), Molecular Cloning: A Laboratory 

Ifenual (2d ed.), Cold Spring Harbor Press; Ausubel et al. 
(1987 and supplements) Cement Protocls in Molecular 
Biology, Greene/Wiley, New York; and Potrykus (1990) 
M Gene Transfer to Cereals: An Assessment," 

25 Bio/Technolocrv 8:535-542, each of which is incorporated 

herein by reference. Other transformation techniques 
include electroporation, DEAE-dextran, microprojectile 
bombardment, lipofection, microinjection, and others. 
The term "transformed cell" is meant to also include the 

30 progeny of a transformed cell. 

As with the purified polypeptides, the nucleic acid 
segments associated with the ligand-binding segment and 
the DNA-binding segment are particularly useful. These 
gene segments will be used as probes for screening for 

35 new genes exhibiting similar biological activities, 

though the controlling elements of these genes are of 
equal importance, as described below. 
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Many types of proteins are preferentially produced 

— in- ucaryot ic - ceHr types^because-of ~abnorTaal~processing 

or modification in other cell types. Thus, mammalian 
proteins are preferably expressed in mammalian cell 
5 cultures. Efficient expression of a desired protein is 

often achieved, as described above, by placing: (1) a 
desired protein encoding DNA sequence adjacent to 
controlling elements responsive to ligands for insect 
steroid receptor superf amily members and an appropriate 

10 promoter. Cyclic pulses of ligands in a cell culture may 

provide periods for cells to recover from effects of 
production of large amounts of exogenous protein. Upon 
recovery, the ligand will often be reinduced. 

Additional steroid responsive gene elements have 

15 also been isolated, e.g., substantially purified, using 

the techniques of the present invention. Other genes 
adjacent to, and operably linked to, steroid responsive 
gene control elements are selectable by locating DNA 
segments to which steroid receptors specifically bind or 

20 by hybridization to homologous controlling elements. For 

example, other steroid responsive genes have been 
isolated. Many of the genes which are ligand-responsive 
will also be new members of the insect steroid receptor 
super family. 

25 Having provided for the substantially pure 

polypeptides, biologically active fragments thereof and 
recombinant nucleic acids comprising genes for them, the 
present invention also provides cells comprising each of 
them. By appropriate introduction techniques well known 

30 in the field, cells comprising them will be produced. 

See , e.g., Sambrook et al. (1989). 

In particular, cells comprising the steroid 
responsive controlling elements are provided, and 
operable linkage of standard protein encoding segments to 

35 said controlling elements produce steroid responsive 

systems for gene expression. Cells so produced will 
often be part of, or be introduced into, intact 
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organisms, for example, plants, insects (including 
caterpillars and larvae) , and higher animals, e.g. r, 
mammals. This provides for regulatable expression of 
desired genes where the regulating ligand has no other 
effects on the cells because the cells otherwise lack the 
receptors and responsive genes. For example, plants will 
be induced to fruit at desired times by administration of 
the appropriate ligand, or animals will be ligand- 
responsive in production of particular products. And, in 
fact, biochemical deficiencies may be overcome by ligand- 
responsive expression of cells introduced into an intact 
organism which, itself, also otherwise lacks genes 
responsive to the presence of such a ligand. Multiple 
repeats of the control elements will lead, often, to at 
least additive or synergistic control. Cells containing 
these expression systems will be used in gene therapy 
procedures, including in humans. 

Once a sufficient quantity of the desired steroid 
receptor polypeptide has been obtained, the protein is 
useful for many purposes. A typical use is the 
production of antibodies specific for binding to steroid 
receptors. These antibodies, either polyclonal or 
monoclonal, will be produced by available in vitro or in 
vivo techniques. 

For production of polyclonal antibodies, an 
appropriate target immune system is selected, typically a 
mouse or rabbit. The substantially purified antigen is 
presented to the immune system in a fashion determined by 
methods appropriate for the animal and other parameters 
well known to immunologists . Typical sites for injection 
are in the footpads, intramuscularly, intraperitoneally, 
or intradermally. Of course, another species will often 
be substituted for a mouse or rabbit. 

An immunological response is usually assayed with an 
immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, 
produced by the same cells and in the same fashion as the 
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antigen was produced. The immunoassay will generally be 
a radioimmunoassay, an enzyme-linked assay (EKESA) , a 
fluorescent assay, or any of many other choices, most of 
which are functionally equivalent but each will exhibit 
advantages under specific conditions. 

Monoclonal antibodies with affinities of 10 8 M" 1 
preferably 10 9 to 10 10 , or stronger will typically be 
made by standard procedures as described, e.g., in Harlow 
and Lane (1988), Antibodies; A Laboratory Manual, Cold 
Spring Harbor Laboratory; or Goding (1986) , Monoclonal 
Antibodies: Principles and Practice (2d ed) Academic 
Press, New York, which are hereby incorporated herein by 
reference. Briefly, appropriate animals will be selected 
and the desired immunization protocol followed. After 
the appropriate period of time, the spleens of such 
animals are excised and individual spleen cells fused, 
typically, to immortalized myeloma cells under 
appropriate selection conditions. Thereafter the cells 
are clonally separated and the supematants of each clone 
are tested for their production of an appropriate 
antibody specific for the desired region of the antigen. 

Other suitable techniques involve in vitro exposure 
of lymphocytes to the antigenic polypeptides or 
alternatively to selection of libraries of antibodies in 
phage or similar vectors. See , Huse et al., (1989) 
"Generation of a Large Combinatorial Library of the 
Immunoglobulin Repertoire in Phage Lambda," Science 
246:1275-1281, hereby incorporated herein by reference. 

The polypeptides and antibodies of the present 
invention will be used with or without modification. 
Frequently, the polypeptides and antibodies will be 
labeled by joining, either covalently or non-covalently, 
a substance which provides for a detectable signal. A 
wide variety of labels and conjugation techniques are 
known and are reported extensively in both the scientific 
and patent literature. Suitable labels include 
radionuclides, enzymes, substrates, cof actors, 
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inhibitors, fluorescens, chemiluminescers, magnetic 
particles and- the- like. Patents, teaching the use of 
such labels include U.S. Patent Nos. 3,817,837; 
3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241. Also, recombinant, chimeric, or 
humanized, immunoglobulins will be produced, see, e.g., 
Cabilly, U.S. Patent No. 4,816,567; Jones et al. # 1986, 
Nature 321, 522-526; and published UK patent application 
No. 8707252; each of which is hereby incorporated herein 
by reference. 

Another use of purified receptor polypeptides is for 
determination of the structural and biosynthetic aspects 
of the polypeptides. Structural studies of interactions 
of the ligand-binding domains with selected ligands are 
performed by various methods. The preferred method for 
structural determination is X-ray crystallography but may 
include various other forms of spectroscopy or 
chromatography. See, e.g. , Connolly, M.L. , J. Appl. 
Crystall^, 16:548 (1983); Connolly, M.L., Science 221:709 
(1983); and Blundell and Johnson (1976) Protein 
Crystal 1 oar aphy . Academic Press, New York; each of which 
is hereby incorporated herein by reference. For example, 
the structure of the interaction between hormone ligand 
and hormone-binding segments is determined to high 
resolution. From this information, minor substitutions 
or modifications to either or both of the ligand and 
ligand-binding segment are made based upon, e.g., the 
contact regions between the two. This information 
enables the generation of modified interactions between a 
ligand and its binding segment to either increase or 
decrease affinity of binding and perhaps increase or 
decrease response to binding. Likewise, the interaction 
between the zinc finger DNA-binding segments with the 
specific nucleic acid-binding sequence will be similarly 
modified. 

As a separate and additional approach, isolat d 
ligand-binding polypeptide domains will be utilized to 
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screen for new ligands. Binding assays will be 
.... . developed, analogous, e.g., to immunoassays. This 

procedure permits screening for new agonists or 
antagonists of a particular steroid receptor. Isolated 
5 DNA-binding segments will be used to screen for new DNA 

sequences which will specifically bind to a particular 
receptor-binding segment. Typically, these receptor- 
specific binding sites will be controlling elements for 
steroid responsive genes. Thus, having isolated these 
10 DNA-binding sequences, genes which are responsive to the 

binding of a given receptor can be isolated. This 
provides a method for isolating genes which are 
responsive to induction or inhibition by a given hormone 
receptor. 

15 In another aspect of the present invention , means 

for disrupting insect development are provided where new 
ligand agonists or antagonists are discovered. These 
compounds are prime candidates as agonists or antagonists 
to interfere with normal insect development. By 

20 application of new steroid analogues of ligands for 

insect steroid receptor superfamily members, it is 
possible to modify the normal temporal sequence of 
developmental events. For example, accelerating insect 
development will minimize generation time. This will be 

25 very important in circumstances where large numbers of 

insects are desired finally, for instance, in producing 
sterile males in Mediterranean fly infestations. 
Alternatively, it is useful to slow development in some 
pest infestations, such that the insects reach 

30 destructive stages of development only after commercial 

crops have passed sensitive stages. 

In another commercial application, ligands 
discovered by methods provided by the present invention 
will be used in the silk-production industry. Here, the 

35 silkworms are artificially maintained in a silk-producing 

larvae stage, thereby being silk productive for extended 
time periods. The development of larvae will also be 
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susceptible to acceleration to silk-production in a 
shorter time peri d_ than- than naturally 

Other analogues of ligands for insect steroid 
receptor superfamily members will be selected which, upon 
application, will completely disrupt normal development 
and, preferably, lead to a lethal result. Slightly 
modified natural substances will often have greater 
specificity of action and much higher activities, 
allowing for lower levels of application. For example, 
more lipophilic ligands are more readily absorbed 
directly into the insect surface or cuticle. Thus, 
extremely low concentrations of natural ligands should be 
effective in controlling pests. Furthermore, many of 
these ligands are likely to be relatively easily 
manufactured, taking advantage of enzymatic production 
methods. New ligands for insect steroid receptor 
superfamily members will sometimes be more species 
specific or will exhibit particularly useful 
characteristics, for example, being lethal to specific 
harmful insects. The greater specificity of the hormones 
will allow avoidance of the use of non-specific 
pesticides possessing undesired deleterious ecological 
side effects, e.g., pesticide residue accumulation in 
food, often having deleterious effects on humans. 
Furthermore, compounds having structures closely 
analogous to natural compounds should be susceptible to 
natural mechanisms of biological degradation. 

Another aspect of the present invention provides for 
the isolation or design of new gene segments which are 
responsive to ligands for insect steroid receptor 
superfamily members. For example, use of the nucleic 
acids to screen for homologous sequences by standard 
techniques will provide genes having similar structural 
features. Similarly arranged intron structures will 
typically be characteristic of larger superfamily 
categories. The preferred domains for screening will be 
the ligand-binding or DNA-binding segments, although the 
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DNA segments which are recognized by the DNA-binding 
domains, i.e. , the controlling lements, will also be of 
particular interest. Screening for new controlling 
elements will usually take advantage of known 
5 similarities, e.g., sequence homology to other known 

elements, or homology to the DNA zinc finger-binding 
domains of other receptors. Receptors and genes 
important in the general developmental sequence of 
expression will be discovered. Using this set of 
10 development ally regulated genes will allow selection of 

particular molecules which are responsible for 
controlling expression of developmentally regulated 
genes. 

Kits for the determination of expression levels of 
15 the nucleic acids and proteins provided herein are made 

available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
binds to the desired target molecule, e.g., ligand 
analogues, receptors, or nucleic acids. These reagents 
20 will be used in techniques for assays, e.g., using 

methods typically used in screening protocols. See, 
e.g., Sambrook et al. (1989) and Ausubel et al. (1987 and 
supplements) • 

The following experimental section is offered by way 
25 of example and not by limitation. 



EXPERIMENTAL 
EXAMPLE I 

CLONING STRUCTURE AND EXPRESSION OF THE DROSOPHILA 
30 E75 GENE THAT ENCODES TWO MEMBERS OF THE STEROID 

RECEPTOR SUPERFAMILY. 

The following experiments demonstrate that the E75 
gene encodes two members of the steroid receptor 
35 superfamily. The proteins it encodes share amino acid 

sequence homology with the conserved DNA-binding and 
ligand-binding domains of this superfamily. The E75 gene 
is ecdysone-inducible, and it occupies and causes the 
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ecdysone-inducible early puff at th 75B locus in the 
Drosophila polytene chromosomev 

A. Cloning of Genomic DNA Encompassing the 
Ecdvsone-Inducible 75B Puff Locus 

We have used the method of chromosomal walking 
(Bender, W. f P. Spierer, and D. S. Hogness, 1983. 
Chromosomal walking and jumping to isolate DNA from the 
Ace and rosy loci and the Bithorax complex in Drosophila 
melanoaaster- J, Mol. Biol. 168:17-33) to isolate the 
genomic DNA encompassing the 75B puff region. The 
starting point for the walk was a genomic clone, 
designated A8253, which had been localized by in situ 
hybridization to the proximal end of 75B. Isolated 
restriction fragments of A8253 were used to screen a 
library of genomic DNA from the Canton S (C 5 ) strain of 
fi. melanocraster . See (Maniatis, T., R. C. Hardison, E. 
Lacy, J. Lauer, C. ©•Connell, D. Quon, G. K. Sim, and 
A. Efstradiatis, 1978, "The isolation of structural genes 
from libraries of eucaryotic DNA." Cell 15; 687-701). 
Genomic clones AcDm3504 and AcDm3505 were isolated by 
homology to A8253. 

The walk was then extended in both directions until 
-100 kb of genomic DNA had been isolated, and the 
orientation of the walk was determined by in situ 
hybridization of the terminal segments to polytene 
chromosomes. Thereafter, the walk was extended in the 
rightward direction on the molecular map, or distally 
relative to the centromere. The 350 kb of genomic DNA 
encompassed by the walk corresponds to the chromosomal 
region between bands 75A6-7 and 75B11-13, as determined 
by in situ hybridization. This region includes the 75B 
puff, which appears to initiate by simultaneous 
decondensation of chromosomal bands 75B3-5 and then 
spreads to surrounding bands. 



Methods 

Genomic DNA libraries 
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Canton S genomic DNAs were isolated from a library 
- of sheared, EcoRI-link red Canton SDNA cloned into the 
Charon 4 A phage vector. See (Maniatis, T., 
R. C. Hardison, E. Lacy, J. Lauer, C. 0»Connell, D. Quon, 
5 G. K. Sim, and A. Efstradiatis, 1978. The isolation of 

structural genes from libraries of eucaryotic DNA. Cell 
15:687-701). O r genomic DNAs were isolated from a library 
of sheared DNA, GC-tailed into the sep6 A vector. See 
(Meyerowitz, F. M. , and D. S. Hogness, 1982. "Molecular 

10 organization of a Drosoohila puff site that responds to 

ecdysone." Cell 28:165-176). One step in the 
chromosomal walk was taken using a cosmid library of 
SauIIIA partially digested O r DNA cloned into the cosmid 
pl4Bl by the method of Ish-Horowicz and Burke (Ish- 

15 Horowicz, D. , and J. F. Burke, 1982. Rapid and efficient 

cosmid cloning. Nucleic Acids Res. 9:2989-2998). 

Jr\ sjtu hyfrridj^tAon 

In situ hybridization to polytene chromosomes was 

20 carried out with DNA probes that were nick-translated in 

the presence of ^-labeled TTP (NEN) , as described by 
Bonner and Pardue (Bonner, J. J., and M. L. Pardue, 1976. 
Ecdysone-stimulated RNA synthesis in imaginal discs of 
Drosophila melanoaaster . Assay by in situ hybridization. 

25 Chromosoma 58:87-99), with the following modifications: 

Heat and RNAase treatments of the slides were omitted, 
and hybridization and washing were at 63 °C in 2XSSPE for 
18 and 2 hours, respectively. 

30 B. Identification of a 50 kb Region of Cloned 

Genomic DNA that Contains Sequences Homologous 
to Ecdvsone- induced Transcripts 

Restriction fragments of the above genomic clones 
35 were tested for their ability to hybridize with each of 

two cDNA probes, one derived from the RNA in ecdysone- 
induced cells, and the other from the RNA in noninduced 
cells. Two differential screens were carried out. In 
the first, genomic DNA covering the entire 350 kb walk 
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was examined with cDNA probes synthesized with reverse 

transcriptase froman- oligo(dT) -primer annealed" to 

poly (A) + RNA. The poly (A) + RNA was prepared from total 
inner tissues that were mass-isolated from late third 
instar larvae and incubated in the presence of ecdysone 
plus cycloheximide, or cycloheximide alone. (See 
Methods, below. Cycloheximide was included because 
higher levels of ecdysone-induced transcripts accumulate 
in its presence.) 

Each of the 32 P-labeled cDNA probes made from these 
two poly (A) + RNAs was applied to one of two duplicate 
Southern blots that contained, in addition to the genomic 
fragments from the walk, a control DNA consisting of 
sequences from the ribosomal protein 49 gene (C^Connell, 
P., and M. Rosbash, 1984. Sequence, structure and codon 
preference of the Drosophil^ ribosomal protein A£ gene. 
Nupleic Acids Res. 12:5495-5513), which was used to 
normalize the hybridization intensities of the duplicate 
blots. This screen revealed sequences specific to 
ecdysone-induced RNAs only within the AcDm3522 genomic 
clone that is centered at approximately +220 kb on the 
molecular map. 

Because the above probes will preferentially detect 
sequences near the 3* termini of the RNAs, particularly 
in the case of long transcripts, a second differential 
screen was carried out with cDNA probes primed with 
random hexamers (see Methods, below) . This screen, which 
was restricted to the 135 kb of genomic DNA between +105 
kb and +240 kb, revealed ecdysone-inducible sequences in 
fragments spread out over an -50 kb region between +170 
kb and +220 kb. This region represents the E75 gene. 

Methods 

Organ culture and RNA isolation 

Late third instar 0 r larvae were harvested, washed in 
0.7% NaCl, resuspended in Robb's phosphate-buffered 
saline (PBS) (Robb, J. A., 1968. Maintenance of imaginal 
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discs of Drosophila melanocraster in chemically defined 
media. J. Cell, Biol. 41:876-885) , preaerated with a 
blender, and passed through a set of rollers to extrude 
the organs. This "grindate" was filtered through a 
5 coarse Nitex screen to remove carcasses, and settled five 

times (3-5 minutes per settling) by gravity to remove 
floating and microscopic debris. Isolated tissues 
(primarily salivary glands, imaginal discs, gut, and 
Malphigian tubules) were cultured at 25 # C in plastic 

10 petri dishes in aerated Robb's PBS. B-ecdysone (Sigma) 

(0.2 fil/ml of 10 mg/ml) in ethanol and/or cycloheximide 
(2 Ml/ml of 35 mM) in water was added to the appropriate 
cultures. Incubations in the presence of cycloheximide 
were for -8 hours. Isolated tissues were homogenized in 

15 10 volumes of 6 M guanidine-HCl/O. 6 M sodium acetate 

(pH 5.2), centrifuged at 5000 g for 10 minutes to remove 
debris, and layered onto a 5.7 M CaCl shelf, as described 
previously (Chirgwin, J. M. , A. E. Przbyla, 
R. J. MacDonald, and W. J. Rutter, 1979. Isolation of 

20 biologically active ribonucleic acid from sources 

enriched in ribonuclease. Biochemistry 18:5294-5299). 
Poly (A) + RNA was purified by oligo(dT) chromatography. 

Southern blot analysis 
25 Southern blots were performed on nitrocellulose, as 

described previously (Segraves, W. A., C. Louis, S. 

Tsubota, P. Schedl, J. M. Rawls, and B. P. Jarry, 1984. 

The rudimentary locus of Drosophila melanoaaster. 

Mol. Biol. 175:1-17). cDNA probes were prepared by 
30 reverse transcription (AMV reverse transcriptase; 

Seikagaku) of 2 fig of poly (A) + RNA with 700 ng of 

oligo(dT) 12 ' 16 (Collaborative Research) or 15 fig of random 

hexamers (Pharmacia) in a 20 /il reaction mixture 

containing 80 mM Tris CI (pH 8.3 at 42 *C), 10 mM MgCl 2 , 
35 100 mM KC1, 0.4 mM DTT, 0.25 mM each of dATP, dGTP, and 

dTTP, and 100 fid of [ 32 P]dCTP (800 Ci/mole; Amersham) . 

After incubation at 37 °C for 45 minutes, 80 /il of 10 mM • 
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EDTA and 2 /il of 5 N NaOH were added before incubation at 
7.0. re for 10 minutes to denature the -products and 
hydrolyze the RNA. After the addition of 10 /il of 1 M 
Tris-Cl (pH 7.5) and 5 /il of 1 N HC1, unincorporated 
5 label was removed by chromatography on Biogel P60. 

C. The E75 Gene Contains Two Overlapping 
Transcription Units: E75A and E75B 

10 Northern blot analysis of ecdysone-induced and 

noninduced RNAs, prepared as described above and 
hybridized with strand-specific DNA probes derived from 
cloned restriction fragments in the 60 kb region (+166 to 
+226 kb) containing the E75 gene, demonstrated that this 

15 gene produces two classes of ecdysone-inducible mRNAs, 

both derived from rightward transcription. The E75A 
class of mRNAs hybridized with probes from both the 5' 
(left) and 3 • (right) ends of the 50 kb 125 gene. The 
E75B class hybridized only with probes from the 3 1 - 

20 proximal 20 kb of the gene. These results suggest that 

the A and B classes of ecdysone-inducible RNAs are 
initiated by different promoters, located about 30 kb 
apart, and that the two transcription units defined by 
these promoters overlap in the region downstream from the 

25 b promoter. 

This suggestion was confirmed by analysis of the 
structure of cloned cDNAs from the E75A and E75B mRNAs. 
Approximately 10 6 clones from an early pupal cDNA library 
(Poole, S. J. f L. M. Kauvar, B. Drees, and T. Kornberg, 

30 1985. The engrailed locus of Drosophila : Structural 

analysis of an embryonic transcript. Cell 40:37-40) were 
screened at low resolution with genomic DNA probes from 
the E75 gene region. The 116 cDNA clones identified by 
this screen were analyzed by restriction digestion and 

35 hybridization to a panel of probes derived from the 60 kb 

(+166 to +226 kb) region. One of the clones, ADm4925, 
was thereby selected as a representative of the E75A 
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ciass of mRNAs , and another, ADm4745, as a representative 

of- the -E7 5B -mRNA-class . — 

The genomic regions homologous to these two cDNA 
clones were further localized by Southern blot analysis, 
and the nucleotide sequence of these regions and of both 
cDNA clones was determined. These sequences are given in 
Table 1, along with those derived from 5 1 and 3' terminal 
sequence determinations for each transcription unit. 
These data demonstrate that the 50 kb E75A transcription 
unit consists of six exons, labeled in 5 1 to 3 1 order: 
AO, Al, 2, 3, 4 and 5, of which exons AO and Al are 
specific to this unit, while the remaining four are 
shared with the 20 kb E75B transcription unit. 
Similarly, the E75B unit contains a specific exon, 
labeled Bl, at its 5 1 end, which is located just upstream 
of the shared exon 2. Thus, the E75 gene consists of two 
transcription units, of which the shorter E75B unit 
occupies the 3* proximal 20 kb of the longer E75A unit. 
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Table 1, Sequences _of the_E75 jexqns_and__ 

flanking DNA. The sequence is that of the C s genomic 
DNA, which was identical to that of the cDNAs, except for 
5 . the T-+G change indicated at position +2691. This change 

would convert a leucine to an arginine in the protein 
sequences. The Dm4925 cDNA extends from just 5' of the 
EcoRV site at +939 to +4267 in A. The Dm4745 cDNA 
extends from +804 in B to a point near the Hindlll site 

10 at +4246 in A. (A) The E75 A exons and flanking DNA. The 

sequences of the AO, Al, and common exons 2-5 are 
interrupted by intron sequences (lowercase) , which are 
limited to those near the splice sites and are in 
agreement with consensus sequences for donor (5 1 ) and 

15 acceptor (3 1 ) splice sites. Negative numbers at the 

right end of each line refer to the number of base pairs 
upstream of the E75 A initiation site, positive numbers 
refer to positions in the E75 A mRNAs, continuing into 
the 3 f flanking DNA. Numbers at the left end of each 

20 line refer to amino acid residues in the E75 A protein. 

The underlined 14 bp sequence at -159 to -172 exhibits a 
13/14 bp match to a sequence (CGTAGCGGGTCTC) found 47 bp 
upstream of the ecdysone-inducible E74 A transcription 
unit responsible for the early puff at 74EF. This 

25 sequence represents the proximal part of a 19 bp sequence 

in the E74 A promoter that binds the protein encoded by 
the £. melanoaaster zeste gene. Another underlined 
sequence in the E75 A promoter at -74 to -82 is also 
found in the E75 B promoter, where it is part of a 

30 tandemly repeated octanucleotide (GAGAGAGC) located at 

-106 to -121 in B. This repeat matches the consensus 
sequence for the binding sites of the GAGA transcription 
factor which also binds to the E74 A promoter. Other 
underlined sequences represent, at -27 to -33, the best 

35 match to the TATA box consensus at an appropriate 

position, three AUG codons that are closely followed by 
in-frame stop codons in the 5» -leader sequence of the E75 
mRNAs, and alternative polyadenylation-cleavage signals 
at 4591 and 5365 that are used by both E75 A and E75B 

40 mRNAs. (B) The Bl exon and its 5 1 -flanking DNA. The 

numbering at the right and left ends of the lines follows 
the same convention as in A. Exons 2-5 shown in A are 
also used in E75 B, but the amino acid residues and base 
pair numbers shown in A must be increased by 157 and 375, 

45 respectively, to apply to the E75 B protein and mRNA. 

The first ten nucleotides of the 136-nucleotide E75 
B-intron linking the Bl exon to Exon 2 are gtaggttag, 
whereas the last ten are shown upstream of nucleotide 
1178 in A. The underlined sequences represent, in order, 

50 the region of homology to a sequence upstream of E75 A, 

noted above, the best match to the TATA box consensus 
at -21 to -27, and three AUG codons followed by in-frame 
stop codons in the 5' leader of the E75 B mRNA. 

Panels 1 and 2 are shown in detail in panels 3-8, 

55 and 9-10, respectively. 
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Methods 

cDNA libraries 

The ADm4925 and ADm4745 cDNAs were isolated from an 
5 0 r early pupal cDNA library in AgtlO (Poole, S. J., 

L. M. Kauvar # B. Drees, and T. Kornberg, 1985. The 
engrailed locus of Drosophila ; structural analysis of an 
embryonic transcript. Cell 40:37-40). The two cDNAs 
(ADm4927 and ADm4928) that were used for 3»-end mapping 
10 were isolated from an ecdysone-induced salivary gland 

cDNA library in A607 prepared by C. W. Jones. (Our 
strain collection names for the cDNA clones used in these 
studies are AfDm4925, AfDm4745, AeDm4927, and AeDm4928.) 

15 yortfreyn p\g% <m<qys;j.s 

Probes to be used for Northern blots were cloned 
into the vector ptpX (from R. Mulligan) , containing the 
0X174 origin of replication cloned in between the Hindlll 
and BamHI sites of pBR322. This allowed the synthesis of 

20 single-stranded probe DNA (Arai, K. , N. Arai, J. 

Schlomai, and A. Kornberg, 1980. Replication of duplex 
DNA of phage 0X174 reconstituted with purified enzymes. 
Proc. Natl. Acad. Sci. 77:3322-3326), which was performed 
by the incubation of .supercoiled plasmid DNA with gene A 

25 protein , rep and ssb proteins, and DNA polymerase III 

holoenzyme in a reaction containing 20 mM Tris CI 
(pH 7.5), 80 /xg/ml BSA, 4% glycerol, 20 mM DTT, 1 mM ATP, 
16 mM concentrations of the three unlabeled 
deoxynucleotides and 1.6 mM concentrations of the labeled 

30 deoxynucleotide for 1 hour at 30 °C. EDTA was then added 

to 20 mM, SDS to 0.1%, and proteinase K to 50 jug/ml. The 
reactions were digested for 30 minutes at 37 °C, and 
unincorporated label was removed by gel filtration. 

35 SI nuclease protection and primer extension analysis 

Single-stranded probes, prepared as described above 
by the <f>X in vitro replication system, were purified by 
electrophoresis on low melting point agarose gels for use 
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as SI probes. All other probes were prepared by 
extension of the -20/ 17-mer sequencing primer (New 
England Biolabs) on single-stranded M13mp (Messing, J. , 
1983. New M13 vectors for cloning. Methods Enzymol. 
101:20-78) or pEMBL (Dente, L. , G. Cesareni, and R. 
Cortes, 1983. pEMBL: A new family of single-stranded 
plasmids. Nucleic Acids Res. 11:1645-1654) recombinant 
templates using 32 P-labeled nucleotides, followed by 
cleavage with the appropriate restriction enzyme and 
purification of the probe on denaturing poly acryl amide 
gels. Labeled probe (100,000-300,000 cpm) was incubated 
with l tig of poly (A) + RNA in a 5 pi reaction mixture 
containing 5 /ig of yeast tRNA, 0.4 M NaCl, 40 mM PIPES 
(pH 6.8), and 1 mM EDTA at 60 *C under oil. Reactions 
were cooled and diluted 1:10 into either SI digestion or 
primer extension buffer. SI nuclease digestions were 
performed in 50 mM acetate buffer (Na) , 400 mM NaCl, and 
4 mM ZnS0 4 at 20 *C for 1 hour with -15 - 150 Vogt units of 
SI nuclease (Boehringer) per 50 Ail reaction. Primer 
extensions were performed at 42 # C in 50 mM Tris CI 
(pH 8,3 at 42 °C), 80 mM KC1, 2 mM DTT, 1 mM of dATP, 
dCTP, dGTP, and dTTP, with 20 units of AMV reverse 
transcriptase (Seikagaku) per 50 /il reaction. Reactions 
were terminated by the addition of EDTA, tRNA carrier was 
added to the SI nuclease digestions, and samples were 
ethanol-precipitated and either electrophoresed directly 
on 5% or 6% denaturing poly acryl amide gels or glyoxalated 
(McMaster, G. K. , and G. C. Carmichael, 1977. Analysis 
of single and double-stranded nucleic acids on 
polyacrylamide and agarose gels by using glyoxal and 
acridine orange. Proc. Natl. Acad. Sci. 74:4835-4838) 
and electrophoresed on 1% agarose gels run in 10 mM 
sodium phosphate buffer (pH 6.8). 

DNA sequence analysis 

The cDNA clones ADm4927 and ADm4928 were sequenced 
by chemical degradation (Maxam, A- M. , and W. Gilbert, 
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1980. Sequencing end-labeled DNA with base-specific 
chemical" cleavage . Methods Enzymol . 65 : 499-560) . All 
other sequencing was performed using the 
dideoxynucleotide chain termination method (Sanger, F. , 

A. R. Coulson, B. F. Barrell, A. J. H. Smith, and 

B. A. Roe, 1980. Cloning in single-stranded 
bacteriophage as an aid to rapid DNA sequencing. J. Mol. 
Biol. 143:161-178). Fragments were cloned into M13mp 
vectors (Messing, J., 1983. New M13 vectors for cloning. 
Methods Enzymol. 101:20-78) or pEMBL (Dente, L. , 

G. Cesareni, and R. Cortes, 1983. pEMBL: A new family of 
single-stranded plasmids. Nucleic Acids Res. 11:1645- 
1654) and sequenced directly or following the generation 
of a set of overlapping deletions using exonuclease III 
(Henikoff, S., 1984. Unidirectional digestion with 
exonuclease III creates targeted breakpoints for DNA 
sequencing. Gene 28:351-359). Sequencing was performed 
on both strands of the ADm4925 cDNA, the B-specific 
region of ADm4745 cDNA, the A- and B-specific 5 1 genomic 
regions not represented in the cDNAs, and the 3 1 -flanking 
region. The remaining exon boundaries of ADm4745 and 
genomic regions represented within the cDNA clones were 
sequenced on one strand. 

D. The E75 Gene Encodes Two Members of the Steroid 
Receptor Superfamily 

The coding and noncoding sequences of the E75 A and 
B mRNAs, their splice junctions, and the 5 f and 3* 
flanking sequences are shown in Table 1. Certain 
sequences of potential interest within the 5* flanking 
DNA and in the 5' leader mRNA sequences are indicated in 
the legend to Table 1. We focus here on the large open 
reading frames of the E75 A and B mRNAs that begin at 
380 bp and 284 bp downstream from their respective mRNA 
start sites, each continuing into the common final exon. 
The termination codon in exon 5 lies upstream of both 
alternative polyadenylation sites; thus, the sequence of 
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the encoded protein is not aff cted by which site is 
sel cted. Since the open reading frames in the E75 A and 
B mRNAs begin in the AO and Bl exons and merge at the 
beginning of exon 2, the proteins encoded by the two 
5 transcription units differ in the amino-terminal region 

and are the same in the carboxy-terminal region. The 
specific amino-terminal regions contain 266 and 423 amino 
acid residues in the E75 A and B proteins, respectively, 
while their common carboxy-terminal region consists of 

10 971 residues. The predicted molecular weights of the A 

and B proteins are thus 132,000 and 151,000. The open 
reading frames display characteristic D. melanoaaster 
codon usage, and their extents have been confirmed by in 
vitro translation of mRNAs transcribed in vitro from cDNA 

15 constructs and by expression of fusion proteins in 

E. coli. The predicted protein sequence for each protein 
is punctuated by homopolymeric tracts of amino acids, 
which are noted in Table 1 and its legend. 

Analysis of the sequences of E75 proteins and 

20 comparison to the sequences of known proteins have 

revealed similarity between the E75 proteins and members 
of the steroid receptor superfamily (Evans, R. M. , 1988. 
The steroid and thyroid hormone receptor superfamily. 
Science 240:889-895;- Green, S., and P. Chambon, 1988. 

25 Nuclear receptors enhance our understanding of 

transcription regulation. Trends in Genetics 4:309-314). 
We have used the nomenclature of Krust el al. (Krust, A., 
S. Green, P. Argos, V. Kumar, P. Walter, J. Bomert, and 
P. Chambon, 1986. The chicken oestrogen receptor 

30 sequence: Homology with v-erbA and the human oestrogen 

and glucocorticoid receptors. EMBO J, 5:891-897) in 
dividing the proteins into six regions, A to F, in the 
amino- to carboxy-terminal direction. 

Similarity between E75A and other members of this 

35 superfamily is strongest in the C region, a cysteine- 

lysine-arginine-rich region that is necessary and 
sufficient for the binding of these receptors to DNA (for 
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review, see , Evans, R. M. , 1988. The steroid and thyroid 
hormone ^receptor superfamily. - Science 240 : 889^-895 ; 
Green, S., and P. Chambon, 1988* Nuclear receptors 
enhance our understanding of transcription regulation, 
5 Trends in Genetics 4:309-314). The C region consists of 

66-68 amino acids, of which 20 residues are invariant 
within this family. Among these are nine invariant 
cysteine residues, eight are believed to coordinate zinc 
in the formation of two zinc finger-like structures 

10 (Miller, J., A. D. McLachlan, and A. Klug, 1985. 

Representative zinc-binding domains in the protein 
transcription factor IIIA from Xenopus oocytes. EMBO J. 
4:1609-1614; Freedman, L. P., B. F. Luisi, 2. R. Korszun, 
R. Basavappa, P. B. Sigler, and K. R. Yamamoto, 1988. 

15 The function and structure of the metal coordination 

sites within the glucocorticoid receptor DNA binding 
domain. Nature 334:543-546; Severne, Y. , S. Wieland, 
W. Schaffner, and S. Rusconi, 1988. Metal binding finger 
structure of the glucocorticoid receptor defined by site- 

20 directed mutagenesis. EMBO J. 9:2503-2508). Within the 

C region, E75A contains all of the highly conserved 
residues and is approximately as closely related to other 
members of the steroid receptor superfamily as they are 
to one another. The closest relative of E75 appears to 

25 be the human ear-1 gene, which has nearly 80% amino acid 

identity to E75 A in the DNA-binding domain. 

The other region conserved among members of the 
steroid receptor superfamily is the E region, which is 
required for steroid binding and for the linkage of 

30 steroid-binding and trans -activation functions (for 

review, see, Evans, R. M. , 1988. The steroid and thyroid 
hormone receptor superfamily. Science 240:889-895; 
Green, S., and P. Chambon, 1988. Nuclear receptors 
enhance our understanding of transcription regulation. 

35 Trends in Genetics 4:309-314). Although overall E-region 

similarity is clearly significant for the comparison of 
E75 A to the thyroid hormone, vitamin D, and retinoic 
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acid receptors , and ear-1, similarity to the 
_ glucocorticoid and estrogen receptors is considerably 
lower. However , the plots of local similarities show a 
clear similarity to each of these proteins within three 
5 subregions of the E region, denoted El, E2 and E3. The 

El subregion is the most highly conserved and corresponds 
to a region shown by in vitro mutagenesis to be essential 
for steroid binding and steroid-dependent trans - 
activation (Giguere, v., S. M. Hollenberg, 

10 M. G. Rosenfield, and R. M. Evans, 1986. Functional 

domains of the human glucocorticoid receptor. Cell 
46:645-652; Danielson, M. , J. P. Northrop, J. Jonklaas, 
and G. M. Ringold, 1987. Domains of the glucocorticoid 
receptor involved in specific and nonspecific 

15 deoxyribonucleic acid binding, hormone activation and 

transcriptional enhancement. Mol. Endocrinol. 1:816- 
822) . Region E2 is less highly conserved in primary 
amino acid sequence, but can, in part, be seen as a 
conserved hydrophobic region in the hydropathy plots of 

20 several of these proteins. A deletion of 14 amino acids 

within this region abolished steroid binding 
(Rusconi, S., and K. R. Yamamoto, 1987. Functional 
dissection of the hormone and DNA binding activities of 
the glucocorticoid receptor. EMBO J. 6:13 09-1315). E3 

25 falls close to the end of the region that is absolutely 

required for steroid binding. 

While the characteristic structural features of the 
steroid receptor superfamily are well conserved in E75, 
two novel variations are seen. The first of these 

30 concerns the structure of the E75 B protein, which 

contains a major alteration within its putative DNA- 
binding domain. The steroid receptor superfamily DNA- 
binding domain consists of two DNA-binding zinc fingers 
separated by a less conserved linker region. In E75, as 

35 in nearly all other genes of this family, an intron is 

found between the two fingers. In E75, this splice marks 
the beginning of the region held in common between the 
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E75 A and B proteins. This results in the E75 A protein 
having two fingers, while the E75-B- protein has unrelat d 
B-specific sequences in place of the first finger. Other 
sequences within the B-specific amino-proximal region may 
5 contribute to the DNA-binding domain of the E74B protein. 

Alternatively, the B protein might bind DNA with 
only one finger, as GAL4 transcription factor of yeast 
appears to do. It is possible that these structural 
differences imply a functional difference in the DNA- 

10 binding properties of the E75 A and B proteins that might 

allow them to differentially regulate the transcription 
of the late genes that characterize the secondary 
response to ecdysone in different target tissues. 

In this respect, it should be emphasized that the 

15 putative hormone- or ligand-binding domain is represented 

by the E region that is common to the E75A and E75B 
proteins. Thus, these proteins appear to be receptors 
for the same hormone, which may regulate the 
transcription of different sets of genes. These proteins 

20 represent "orphan" receptors, in that their hormone, or 

binding ligand, has not yet been identified. Because 
ecdysteroids are the only known steroid hormones in 
Drosophila , the most obvious candidate for an E75 ligand 
would be ecdysone itself. However, it is unlikely that 

25 this is the case, since the putative hormone-binding 

domain of the E75 proteins does not exhibit the high 
sequence homology to that of the known Drosophila 
ecdysone receptor encoded by the EcR gene (see 
Experimental Example III and Table 2) that would be 

30 expected if the E75 proteins were also ecdysone 

receptors. It, therefore, seems likely that the E75 
proteins would bind either a terpenoid juvenile hormone 
or a novel Drosophila hormone. 

The second unusual feature of the E75 proteins is 

35 the presence of a large F region, encompassing nearly one 

half of the proteins. Many of the other receptors have 



WO 91/13167 



PCT/US91/01189 



56 

very small F regions, and no function has yet been 
ascribed t this region. 

Methods 

Protein sequence analysis 

Sequence data were compiled using the Bionet system. 
Protein sequence comparison was performed using FASTP 
(Lipman, D. J. , and W. R. Pearson, 1985. Rapid and 
sensitive protein similarity searches. Science 227:1435- 
1441) and Bionet IFIND programs. 

E. Expression Vectors for E75 Proteins 
In order to express the E75 proteins, portions of 
cDNAs and genomic clones were fused in order to generate 
cassettes containing the entire E75 A and E75 B protein 
coding regions. First, BamHI sites were introduced into 
genomic clones upstream of the initial AUGs of the large 
open reading frames. Then, E75 AO exon sequences were 
fused to sequences of a nearly full-length E75 A cDNA, 
and E75 Bl exon sequences were fused to sequences of a 
nearly full-length E75 B cDNA. These cassettes were 
cloned into pGEM3 (Promega) , and transcripts of the open 
reading frames were prepared using T7 polymerase. These 
were then translated in the presence of 35 S-methionine, 
and shown to give rise to proteins of appropriate size. 

These cassettes have been placed into a variety of 
expression vectors, including pUCHsneo/Act for expression 
in Drosophila cells, pSV2 for expression in mammalian 
cells, and pOTS for expression in bacterial cells. 

Methods 

BamHI sites were introduced directly upstream of the 
initial ATGs of the E75A and E75B coding sequence — at 
the Sspl site upstream of the E75A initial ATG, and at 
the SacII site upstream of the E75B initial ATG. cDNA 
and genomic sequences were joined at the EcoRV site in 
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the AO exon to construct an E75A cassette , and at the 
Mlul in exon 3 to construct an E2§B cassette. 

EXAMPLE 1J. 

CLONING, STRUCTURE AND EXPRESSION OF THE EcR AND 
DHR3 GENES THAT ENCODE ADDITIONAL MEMBERS OF THE 
STEROID RECEPTOR SUPERFAMILY . 

The following experiments were carried out after the 
primary structure of the E75 gene, and of the two members 
of the steroid receptor superfamily that it encodes, was 
determined (Experimental, Example I). The purpose of 
these experiments was to clone and determine the primary 
structure of other steroid receptor superfamily genes 
from Drosophila . and of the proteins they encode. The 
aim was to identify the gene that encodes a Drosophila 
ecdysone receptor, given that the characteristics of the 
E75 gene indicated that it did not encode an ecdysone 
receptor. The first stage of the experimental plan was 
to use the conserved sequences in the E75A transcription 
unit that encode the putative DNA-binding domain of the 
E75A receptor protein as a probe to screen a Drosophila 
genomic library to identify sequences encoding the 
putative DNA-binding domains of other Drosophila members 
of the steroid receptor superfamily. The second stage 
was to isolate cDNA clones corresponding to the 
identified genes, as well as additional genomic DNA 
clones, to obtain the nucleotide sequence of the complete 
coding region (i.e., the open reading frame encoding the 
respective receptors) and the exon-intron organization of 
these genes. 

The experiments described below resulted in the 
cloning and structural characterization of two genes that 
satisfy the criteria for bona fide members of the steroid 
receptor superfamily: encoding proteins that exhibit 
amino acid sequence homology to both the DNA-binding and 
the hormone-binding domains that are conserved among 
members of this superfamily. The two genes are called 
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EcR and DHR3 . The EcR gene was originally called DHR23 , 

~but^was^renamed~E^^after^it:~was~"shown^to~^encode^an 

ecdysone receptor (see Experimental Example III) . The 
DHR3 designation stands for Drosophila Hormone 
Receptor 3. 

A. Identification and Chromosomal Mapping of EcR 
and DHR3 Genomic Clones 

Initially f Southern blots of total Drosophila 
genomic DNA, digested with one or more restriction 
endonucleases r were probed with a 530 bp fragment of the 
E75A cDNA containing the sequences encoding the putative 
DNA-binding domain of the E75A receptor protein (see 
Experimental Example I) at low and high stringency 
hybridization conditions. 

To isolate the sequences responsible for these low 
stringency bands, this E75A probe was used to screen a 
Drosophila genomic library under the same low stringency 
conditions, counterscreening duplicate filters with E75 
intron probes to eliminate phage containing inserts from 
the E75 gene. Five genome equivalents were screened and 
39 non-E75 containing phage were isolated. The 25 most 
strongly hybridizing clones were divided into six classes 
on the basis of restriction patterns and cross 
hybridization, each class containing between one and six 
independent overlapping genomic inserts. 

For each class, a restriction fragment containing 
the region of hybridization to the E75A probe was 
localized by Southern blotting. Hybridization of probes 
derived from these fragments to genomic Southern blots 
showed that each of the low stringency bands detectable 
by the E75A probe could be accounted for by one of the 
six isolated fragments. 

The nucleotide sequences of the six restriction 
fragments were determined to test whether they represent 
candidate receptor genes. In all cases, DMA sequence 
similarities with the E75A probe were observed that are 
sufficient to account for the hybridization of these 
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fragments with the probe. When the DNA sequences were 
conceptual ly translated ixT~a3T~s i^rea~ding~f ramesT~f oW 
of the fragments yielded no significant sequence 
similarity with E75A at the protein level. The remaining 
two clones , however, showed predicted amino acid 
sequences with strong similarity to the DNA binding 
domains of the E75A protein and other steroid superfamily 
receptors . 

These two clones represent the EcR and DHR3 genes, 
as will become apparent. Probes from these clones were 
used to map the position of these genes in the polytene 
chromosomes by in situ hybridization. The EcR and DHR3 
chromosomal loci were mapped to positions 42A and 46F, 
respectively, in the right arm of the second chromosome. 

B. Structure of the EcR and DHR3 Genes and Their 
cDHAs 

The DHR3 and EcR genomic clones described above were 
used to screen a cDNA library prepared from third instar 
tissues treated with ecdysone and cycloheximide. This 
procedure allowed the isolation of a large number of cDNA 
clones, since both genes have a peak period of 
transcription in late third instar after the rise in 
ecdysone titer. For each gene, 20 cloned cDNAs were 
purified and their lengths determined. Restriction maps 
for the 10 longest cDNAs from each gene were determined 
and found to be colinear. 

For EcR, a 5534 bp cDNA sequence was obtained from 
two overlapping cDNA clones. It contains an 878 codon 
open reading frame (ORF) which yields a predicted amino 
acid sequence expected for a member of the steroid 
receptor superfamily (Table 2) , as described in more 
detail below. The length of the largest DHR3 cDNA that 
was isolated (clone DHR3-9) is 4.2 kb. The nucleotide 
sequence of this cDNA was determined and found to contain 
a 487 codon AUG-initiated open reading fram (Table 3). 
As described below, the amino acid sequence of the DHR3 
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protein predicted from this sequence demonstrates that 

this-prot in is-also a bona~f ide^ member of the steroid 

receptor superfamily. 
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Table-2. Th -cDNA sequenc ..of-the^EcR gene. 

Numerals at the left refer to the nucleotide sequences; 
those on the right to the amino acid sequence in the EcR 
protein. Nucleotides 1-5194 are the sequence of EcR-17 
cDNA, while nucleotides 5195-5534 derive from the EcR-9 
cDNA. The underlined sequences in the 5 1 and 3 1 
untranslated regions refer , respectively, to the ATG 
codons and the AATAAA consensus polyadenylation signals. 
Positions of the introns and the donor and acceptor 
splice sequences are indicated above the cDNA sequence in 
small type. The amino acid sequences homologous to the 
conceived DNA-binding (C region) and hormone-binding (E 
region) domains of the steroid receptor superfamily are 
underlined. 

Panel 1 is shown in detail in panels 2 and 3. 



WO 91/13167 



PCT/US91/01189 



62 

Table 3. The cDNA sequence of the DHR3 gene. 
The numbering and underlining of the nucleotide and amino 
acid sequences hav the same meaning as in Table 2, and 
the intron positions and donor and acceptor splice 
5 sequences are similarly indicated • The sequence of the 

5 1 proximal 2338 nucleotides of the DHR3-9 cDNA is shown. 
The sequence of the remainder of this 4.2 kb cDNA was 
determined for only one strand and is not shown. Four 
silent, third-position differences between the cDNA and 
10 genomic DNA sequences are indicated above the cDNA 

sequence. 
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The genomic structur of the EcR and DHR3 genes 
was investigated by isolating additional genomic DNA 
clones that form overlapping sets that contain all of the 
sequences found in the respective cDNA clones. The exons 
contained in these cDNAs were mapped within the genomic 
DNA by comparison of cDNA and genomic clones via Southern 
blot analysis, mapping of restriction cleavage sites, and 
finally, by determination of the nucleotide sequence of 
the genomic DNA in regions that contain the exon/intron 
boundaries. Table 2 and 3 show these boundaries and the 
sequence of the splice junctions for the EcR and DHR3 
genes, respectively. All of these splice junctions 
conform to the splice donor and acceptor consensus 
sequences • 

For EcR . the cDNA sequence shown in Table 2 is split 
into six exons spread over 36 kb of genomic DNA, with the 
ORF beginning in the second exon and ending in the sixth. 
For DHR3, the cDNA sequence derives from nine exons 
spread over 18 kb, with the ORF beginning in the first 
exon and ending in the ninth. Because the 5 1 and 3' ends 
of the respective mRNAs were not mapped, it should be 
emphasized that these genes may have additional noncoding 
exons at their 5 1 or 3 1 ends . 

The EcR and DHR3 gene structures differ 
significantly from those of all previously examined 
steroid receptor superfamily genes. Comparison with the 
genes for 11 other receptor homologues for which at least 
partial structural information is available reveals that 
the positions of certain exon boundaries have been 
conserved in evolution • This conservation is most 
striking in the portion of the genes encoding DNA-binding 
domains. In the nine other cases where the structure of 
this region has been examined, the two halves of the DNA- 
binding domain are always encoded by separate exons. If 
we exclude the Drosophila genes knirps . knirps-related , 
and eaon (which are not bona fide receptor homologues 
since they lack the hormone-binding domain sequence 
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similarity) , these are always small exons, the second one 

invariably-ending-in-the-fourth-codonbeyond-the 

conserved Met codon at the end of the c region. Thus, 
these exons each encode one of the two predicted zinc 
fingers of the DNA-binding domain. In contrast, both 
zinc fingers of the putative DNA-binding domain of the 
EcR and DHR3 receptors are encoded by a single exon. It 
is possible that our screen specifically selected for 
genes lacking the above intron. The screen selected 
genomic clones that hybridize to an E75A cDNA probe that, 
of course, lacks this intron. Genomic sequences 
containing a contiguous sequence encoding the DNA-binding 
domain would be expected to hybridize to this probe 
better than clones from genes containing the intron. 
This would explain the successful isolation of the EcR 
and DHR3 genes, and the failure to isolate the genes of 
other DyosophjLla members of the steroid receptor 
superfamily. 

Methods 

Isolation o f cDNA and additional genomic clones 

Subclones of the originally isolated DHR3 and EcR 
genomic clones were used to screen a cDNA library 
prepared from third instar tissues treated with ecdysone 
and cycloheximide. This library was chosen because both 
genes are relatively highly expressed at the end of third 
instar, and because of the high quality of the library. 
Of the 270,000 primary plaques screened, 20 positives for 
DHR3 and 220 for EcR were detected. Twenty cDNAs for 
each gene were purified, of which the ten largest for 
each were restriction mapped and found to be colinear. 
cDNA DHR3-9, which extends further in both the 5 1 and 3' 
directions than our other DHR3 cDNAs , was chosen for 
sequencing. For EcR . the longest cDNA, EcR-17, extended 
the farthest 5 1 and was sequenced in its entirety. An 
additional cDNA clone, EcR-9, was found to extend 300 bp 
farther 3' than EcR-17, and this 3 1 extension was also 
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sequenced. Additional genomic DNA clones covering the 
EcR and~ DHR3 genes wer obtaihed~by screening the ~ 
Drosophila Canton S genomic library referred to in part A 
above, either with probes from the respective cDNA 
5 clones, or, for overlapping clones, by the chromosomal 

walk method described in Experimental Example I. 

PNA sequence epalysjs 

cDNAs were subcloned into BlueScript vectors 
10 (Stratagene) , and clones for sequencing were generated by 

exonuclease III digestion (Henikoff, S., 1984. 
Unidirectional digestion with exonuclease III creates 
targeted breakpoints for DNA sequencing. Gene 
28:351-359) . 

15 Double-stranded plasmids were denatured (Gatermann, 

K. B., G. H. Rosenberg, and N. F. Kaufer, 1988. Double- 
stranded sequencing, using mini -prep plasmids, in 11 
hours. BioTechnioues 6:951-952) and sequenced by the 
dideoxy chain terminating method (Sanger, F. , S. Nicklen, 

20 and A. R. Coulson, 1977. DNA sequencing with chain- 

terminating inhibitors. Proc. Natl. Acad. Sci. USA 
74:5463-5467), using the enzyme Sequenase (U.S. 
Biochemical) • cDNA EcR-17 was completely sequenced on 
both strands, as was the EcR-9 3 1 extension. cDNA DHR3- 

25 9 was sequenced on both strands for the 5" most 2338 bp, 

which contains the entire ORF, and the remainder of the 
long 3 1 untranslated region was sequenced on one strand. 

The exon/intron boundaries in genomic DNA clones 
were first mapped at low resolution by Southern blot 

30 analysis of their restriction fragments probed with 

labeled cDNAs. Genomic DNA surrounding each exon/intron 
boundary was subcloned and the nucleotide sequence of 
these subclones determined as above. 

Genomic exons were either sequenced entirely, or for 

35 the longer exons, were digested and electrophoresed in 

parallel with cDNA clones to confirm the colinearity of 
the genomic and cDNA clones. Shorter exons were 
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completely sequenced from genomic clones. Longer exons 
had their" boundaries sequenced from genomic clones/ and 
were confirmed to be colinear with the cDNA clones by 
parallel digestion and electrophoresis of the cDNA and 
genomic clones. 

C. The Predicted Amino Acid Sequence of the EcR 
and DHR3 Proteins and their Implic ations 

Comparison of the predicted EcR and DHR3 protein 
sequences to the sequence database and to individual 
members of the steroid receptor superf amily shows that 
these proteins share the two conserved domains 
characteristic of this superf amily (Evans , R. *M. , 1988. 
The steroid and thyroid hormone receptor superf amily. 
Science 240:889-895; Green, S., and P. Chambon, 1988. 
Nuclear receptors enhance our understanding of 
transcription regulation . Trends in Genetics 4:309-314). 
We refer to the domains as the C and E regions, for the 
more amino-terminal and more carboxy-terminal homologies, 
respectively, according to the nomenclature of Krust et 
al. (Krust, A., S. Green, P. Argos, V. Kumar, P. Walter, 
J. M. Bornert, and P. Chambon, 1986. The chicken 
oestrogen receptor sequence; homology with v-erbA and the 
human oestrogen and glucocorticoid receptors. EMBO J. 
5:891-897). These domains are underlined in Tables 2 and 
3, and Table 4A-C presents a comparison of these domains 
from EcR and DHR3 with those from representative members 
of the super family. 
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Table"4 . ~Sequence~comparison~of the~conserved - 
C and E regions in DHR3 , EcR, and some representative 
nuclear receptor homologies. (A) C-region alignment. 
Numbers at the left indicate the amino acid positions 
within the individual receptors; dashes indicate gaps 
introduced to obtain maximal alignment. Dots indicate 
three positions important in determining the DNA binding 
specificity of this domain. (B) E-region alignment. 
Bars indicate the three most highly conserved stretches 
within this domain. (C) Computed percent identities 
among the C-region sequences (lower left) and among the 
E-region sequences (upper right) . The kni sequence shows 
no significant E-region homology and is, therefore, not 
included in this comparison. Sequences shown are from: 
E75A, Drosophila ecdysone-inducible gene at 75B; kni/ 
Drosophila segmentation gene knirps : hRARa, human 
retinoic acid receptor alpha; htRB, human thyroid 
receptor beta; hVDR, human vitamin D receptor; cOUP-TF, 
chicken ovalbumin upstream promoter transcription factor; 
hERRl and hERR2, human estrogen-related receptors 1 and 
2; hER, human estrogen receptor; hGR, human 
glucocorticoid receptor; hMR, human mineralocorticoid 
receptor; hGR, human progesterone receptor. 

Panel 1 is shown in detail in panels 2-6. 
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The C region is a 66-68 amino acid domain that 
has been shown to function as a zinc finger- DNA binding 
domain in vertebrate receptors. This domain has also 
been implicated in receptor dimerization ( Kumar , V. , and 
P. Chambon, 1988. The estrogen receptor binds tightly to 
its responsive element as a ligand- induced homodimer. 
Cell 55:145-156). As shown in Table 4A, all 19 C-region 
residues that are absolutely conserved in the other 
receptor homologues are also conserved in DHR3 and EcR, 
including the nine invariant Cys residues, eight of which 
coordinate two zinc ions (Freedman, L. P., B. F. Luisi, 
Z. R. Korszun, R. Basavappa, P. B. Sigler, and 
K. R. Yamamoto, 1988. The function and structure of the 
metal coordination sites within the glucocorticoid 
receptor DNA binding domain. Nature 334:543-546). As 
seen in Table 4C, the Drosonhila C-region sequences 
(including those of E75A) are not more closely related to 
each other than they are to those from the vertebrate 
receptor homologues. The C region of DHR3 is most 
similar to that of the human retinoic acid receptor a 
(hRARa) , and the C region of EcR is most similar to that 
of the human thyroid receptor fi (hTRB) . Studies on the 
human glucocorticoid receptor (hGR) and human estrogen 
receptor (hER) have identified three C-region residues 
(indicated by dots in Table 4A) that are critical for 
determining the differential DNA binding specificity of 
these receptors (Mader, S., V. Kumar, H. de Verneuil, and 
P. Chambon, 1989. Three amino acids of the estrogen 
receptor are essential to its ability to distinguish an 
estrogen from a glucocorticoid-responsive element. 
Nature 338:271-274; Umesono, K. , and R. M. Evans f 1989. 
Determinants of target gene specificity for 
steroid/thyroid hormone receptors. Cell 57:1139-46). 
The three Drosoohila proteins DHR3 , EcR, and E75A, as 
well as the vertebrate receptors hRARa, hTRB, and the 
human vitamin D receptor (hVDR) , all have identical amino 
acids at these three positions; thus, these proteins may 
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all have similar DNA binding specificities, as has 
already been shown for hRARa and liTEB (Umesono, K. , 
V. Giguere, C. K. Glass , M. G. Rosenfeld, and 
R. M. Evans, 1988. Retinoic acid and thyroid hormone 
induce gene expression through a common responsive 
element. Nature 336:262-265). 

The E-region is an -225 amino acid domain that 
functions as a hormone-binding domain in vertebrate 
receptors. This domain has also been implicated in 
hormone dependent receptor dimerization (Kumar, V. and 
P. Chambon, 1988. The estrogen receptor binds tightly to 
its responsive element as a ligand- induced homodimer. 
Cell 55:145-156; Guiochon, M. A., H. Loosfelt, P. Lescop, 
S. Sar, M. Atger, A. M. Perrot, and E. Milgrom, 1989. 
Mechanisms of nuclear localization of the progesterone 
receptor: evidence for interaction between monomers. 
Cell 57:1147-1154), hormone dependent nuclear 
localization of the glucocorticoid receptor (Picard, D. , 
and K. R. Yamamoto, 1987. Two signals mediate hormone- 
dependent nuclear localization of the glucocorticoid 
receptor. EMBO J. 6:3333-3340), and binding of the 
glucocorticoid receptor to the 90 kDa heat shock protein 
(Pratt, W. B., D. J. Jolly, D. V. Pratt, 
W. M. Hollenberg, V. Giguere, F. M. Cadepond, 
G. G. Schweizer, M. G. Catelli, R. M. Evans, and 
E. E. Baulieu, 1988. A region in the steroid binding 
domain determines formation of the non-DNA-binding, 9 S 
glucocorticoid receptor complex. J. Biol. Chem. 263:267- 
273) . Table 4B shows an alignment of the E regions of 
the DHR3 and EcR proteins with those of other receptor 
homologues. The three relatively highly conserved 
stretches within this region noted in Experimental 
Example I are overlined; each contains a cluster of 
residues conserved in all or most of the receptor 
sequences. DHR3 and EcR show strong similarity to each 
other and to the other proteins in these stretches, and a 
lower similarity outside of them. The presence of this 
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E-region homology establishes these proteins as bona fide 
members of the nuclear receptor f amily , in contrast to 
the Drosophila, knirps (Nauber, U. , M. J. Pankratz, 
A. Kienlin, E. Seifert, U. Klemm f and H. Jackie, 1988. 
Abdominal segmentation of the Drosophila embryo requires 
a hormone receptor-like protein encoded by the gap gene 
ImiEES. Nature 336:489-492), kniros-related (Oro, A. E., 
E. S. Ong, J . S. Margolis, J. W. Posakony, M. McKeown, 
and R. M. Evans, 1988. The Drosophila gene knirps- 
related is a member of the steroid-receptor gene 
superfamily. Nature 336:493-496), and eaon (Rothe, M. , 
U. Nauber, and H. Jackie, 1989. Three hormone receptor- 
like Drosophila genes encode an identical DNA-binding 
finger. EMBO J. 8:3087-3094) proteins, which show C- 
region homology but no E-region homology. The E region 
in DHR3 is most similar to that of E75A, and the E region 
of EcR is most similar to that of hTRfi, although the 
level of these similarities is lower than those found 
among E regions of many other receptors (Table 4C) . 
Thus, DHR3 and EcR are not especially close homologues of 
any previously cloned receptors. Comparison of E-region 
sequences allows division of the nuclear receptors into 
subfamilies (Petkovich, M. , N. J. Brand, A. Krust, and 
P. Chambon, 1987. A human retinoic acid receptor which 
belongs to the family of nuclear receptors. Nature 
330:444-450), the members of any one subfamily being more 
related to each other than to those in other subfamilies. 
The DHR3 and EcR receptors fall into a subfamily with the 
E75A, E75B, hRARa, hTRB, and hVDR receptors. 

D. In Situ Labeling of the EcR and DHR3 Proteins 

with Antibodies Induced by Proteins Produced in 
E- coli _ 

To determine the intracellular and tissue 
distribution of the EcR and DHR3 proteins in Drosophila , 
affinity-purified polyclonal antibodies directed against 
those proteins were produced in the following manner. 
The region of about 120 amino acid residues located 



WO 91/13167 



PCT/US91/01189 



71 

between the conserved DNA-binding and hormone-binding 

-domains- of-these-proteins-was~used- as the~immunogen~to 

produce antibodies against each protein. Thus, the 
coding sequences for amino acids 335-447 of the EcR 
protein and for amino acids 164-289 of the DHR3 protein 
(see Tables 2 and 3, respectively) were cloned into the 
appropriate pATH (Dieckmann, C, and A. Tzagaloff, 1985. 
J. Biol. Chem. 260:1513-1520) or pUR expression vectors, 
so as to fuse these coding sequences to those encoding £• 
coli fi-galactosidase (figal) or to E. coli tryptophan E 
protein (trpE) , respectively. 

The Bgal fusion proteins were produced in £. coli by 
the addition of the IPTG inducer to exponential cultures, 
while the production of trpE fusion proteins were induced 
by dilution into tryptophan-free media and subsequent 
addition of indoleacetic acid. For EcR, the trpE fusion 
protein was used as an immunogen and the Bgal fusion 
protein was used on immunoblots to test sera for 
immunoreactivity to the EcR portion of the fusions. For 
DHR3, the figal fusion protein was injected, and sera were 
checked against the trpE fusion protein. 

For immunization the appropriate fusion protein was 
prepared by electrophoresis on SDS-PAGE gels and 
visualized by staining in ice-cold 0.25 M KC1, after 
which the fusion protein band was cut out. Approximately 
100 fxg of fusion protein in 0.25 ml of gel slice was 
crushed by passing through successively smaller 
hypodermic needles, and mixed with 0.25 ml of a sterile 
saline solution and 0.5 ml of Freund*s complete adjuvant. 
For each immunogen, two New Zealand White rabbits were 
injected at multiple intramuscular sites, and after one 
month, boosted at two-week intervals, omitting the 
Freund's adjuvant. While the Bgal fusion proteins were 
subject to the above gel electrophoresis without prior 
purificati n, the trpE fusion proteins were first 
purified by the following method which takes advantage of 
their insolubility in vivo . 
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£. coli from a two-liter culture of induced cells 
were washed, and the cell pellet was subjected to several 
freeze/thaw cycles. The cells were resuspended in 18 ml 
of 50 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, and 1.8 ml of 10 
mg/ml lysozyme was added. After 15 minutes on ice, the 
cells were lysed by passing three times through a french 
pressure cell at 10,000 psi. The insoluble fraction was 
collected by centrifugation at 27,000 x g for 15 minutes, 
and washed by resuspension, using a Dounce homogenizer, 
in ice-cold 50 mM Tris-HCl, 0.5 mM EDTA, 0.3 M NaCl, 
followed by centrifugation as above. The washing step 
was repeated, and the final pellet dissolved in 10 ml of 
4M urea, 2% (w/v) SDS, 50 mM Tris-HCl, pH 7.5, 1 mM EDTA, 
5% (v/v) 2-mercaptoethanol. Material remaining insoluble 
was centrifuged out and discarded. 

The antisera was affinity purified in a two-step 
procedure by successive passage through "nonspecific" and 
"specific" affinity columns. In the case of antibodies 
raised against the trpE fusion proteins, the nonspecific 
column consisted of resin coupled to the insoluble 
protein derived from E. coli expressing unmodified trpE 
protein, and was used to remove antibodies directed 
against trpE epitopes, as well as against insoluble E. 
coli protein impurities. The specific column consisted 
of resin coupled to the EcR-trpE fusion protein (purified 
as described above) and was used to absorb the desired 
antibodies directed against the EcR epitopes, antibodies 
that were subseguently released from the column. In the 
case of antibodies raised against the Bgal fusion 
proteins, the same general procedure was used, except 
that the resin in the nonspecific column was coupled to 
B-galactosidase, while that in the specific column was 
coupled to the DHR3-Bgal fusion protein. Western blot 
analysis of the appropriate fi. coli extracts demonstrated 
that these affinity-purified antibodies exhibited the 
desired specificity. 
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The intracellular distribution of the EcR protein in 
late third-instar-salivary-glands-was -examined -by~An-si£n 
labeling of this protein with the anti-EcR antibody. The 
EcR protein was thereby shown to be highly localized in 
the nuclei of these glands. Indeed, when the polytene 
chromosomes in these nuclei were examined by the 
antibody-labeling method of Zink and Paro (Zinc, B., and 
R. Paro, 1989. Nature 337:468-471), specific loci within 
these chromosomes exhibited strong binding of the EcR 
protein. In particular, the EcR protein was bound to the 
early puff loci, including those occupied by the E75 and 
E74 genes. This is the result expected if the ecdysone 
receptor encoded by the EcR gene is that which induces 
the transcription of the early genes, as anticipated by 
the Ashburner model. Another prediction of the Ashbumer 
model is that the ecdysone-receptor complex initially 
represses the genes responsible for the later puff, so 
that the transcription of the late genes induced by the 
early gene proteins is delayed until these proteins 
accumulate sufficiently to overcome this initial 
repression. If the EcR receptor is involved in this 
postulated initial repression, then one would expect the 
EcR protein to bind to the late puff loci in the salivary 
glands. This expectation was met by the observation that 
EcR protein also binds to the late puff loci in the 
polytene chromosomes. 

Additional in situ antibody labeling experiments 
demonstrated that the EcR protein is present in the 
nuclei of all ecdysone target tissues examined in late 
third instar larvae. It is also present in most, if not 
all, cells during embryogenesis and other stages of 
Drosophila development that have been examined. In this 
respect, the EcR protein was not detected by anti-EcR 
antibody labeling of embryos in which the EcR gene was 
eliminated by a chromosomal del tion, further 
d monstrating the specificity of this antibody. 
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In contrast to the widespread distribution of the 
EcR protein, anti-DHR3 antibody labeling of embryos 
demonstrated that the distribution of the DHR3 protein is 
highly restricted during this stage of development. 
During the brief embryonic period of expression, the 
protein is restricted to the peripheral nervous system, 
and to cells surrounding the spiracles at the posterior 
end of the embryo. 

Finally, it should be noted that affinity-purified 
antibodies against the E75A protein have also been 
prepared by the same technique described above for anti- 
EcR and anti-DHR3 antibodies. In situ antibody labeling 
of the E75A protein in larval salivary glands has also 
demonstrated that this protein is localized in the 
nucleus and is bound to specific loci in the polytene 
chromosomes. 

EXAMPLE III 

THE ECDYSTEROID-BINDING, DNA- BINDING AND GENETIC 
REGULATORY PROPERTIES OF THE EcR PROTEIN DEMONSTRATE 
THAT IT IS AN ECDYSONE RECEPTOR. 

The following experiments demonstrate that the 
protein encoded by the EcR gene is an ecdysone receptor 
by the following three criteria. (l) The EcR protein 
binds ecdysteroids and accounts for a large proportion, 
if not all, of the ecdysteroid-binding activity present 
in Drosophila embryos and in a variety of cultured 
Drosophila cells. (2) The EcR protein binds with high 
specificity to a DNA sequence that functions as an 
ecdysone response element (EcRE) , i.e., an enhancer that 
confers ecdysone inducibility to a promoter. (3) Cells 
that do not respond to ecdysone because they lack 
functional ecdysone receptors are transformed to the 
ecdysone-responsive state by transfection with an EcR 
expression plasmid. 



A. The EcR Protein Binds Ecdvstaroids 
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The EcR xpression plasmid, pMTEcR, shown in 
Figur 1 contains" the open reading frame encoding the EcR 
protein ( EcR ORF; see Experimental Example II) fused to 
the Drosophila metallothionine promoter (Pmt) at its 5 1 
5 end, and the polyadenylation-cleavage sequences of the 

Drosophila Actin 5C gene at its 3 1 end. Because 
transcription of the EcR ORF is under the control of this 
metallothionine, that transcription is induced by Cu 2+ ion 
to yield an mRNA that, in turn, produces the EcR protein. 

10 A cell line, MtEcRHy, that overproduces this protein upon 

Cu 2+ induction, as determined by Western blot analysis 
using the affinity-purified anti-EcR antibody (see 
Experimental Example II) , was constructed by the stable 
integration of the pMTEcR plasmid DNA into the genome of 

15 Drosophila Sch-2 cell line. A control cell line, MtHy, 

was similarly constructed by the integration of the 
expression vector DNA lacking the EcR ORF. 

Whole cell extracts were prepared from both the 
MtEcRHy and MtHy cell lines after Cu 2+ induction, and were 

20 assayed for ecdystero id-binding activity using the high 

affinity ecdysone analogue [ 125 I] iodoponasterone A. The 
MtEcRHy extract contained sevenfold more saturable 
ecdysteroid-binding activity than the MtHy control 
extract . 

25 To see if the induced ecdysteroid-binding activity 

was due to the EcR polypeptide itself, the EcR protein 
was depleted from the MtEcRHy extract by 
immunoprecipitation using an affinity-purified anti-EcR 
polyclonal antibody, or, as a control, the extract was 

30 mock-depleted with preimmune serum. The treated extracts 

were then assayed for ecdysteroid-binding activity. 
Comparison of the immuno-depleted extract with the mock- 
depleted extract showed that most of the binding activity 
was removed by the anti-EcR antibody treatment, 

35 indicating that the indue d ecdysteroid-binding activity 

results from the EcR protein. 
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The endogenous ecdysteroid-binding activity in the 
control cell line, MtHy, was unchamged by Cu 2+ exposure, 
and was approximately the same as that in the Sch-2 cell 
from which it derives. The question arises as to whether 
the endogenous activity in these and other Drosophila 
cell lines, as well as in embryonic extracts, results 
from the expression of the EcR gene in their respective 
genomes. To answer this question, extracts from embryos 
and several cell lines were immuno-depleted and mock- 
depleted, as described above, and assayed for 
ecdysteroid-binding activity. Again, comparison of these 
treated extracts showed that the large majority of the 
endogenous binding activity was removed in each case by 
treatment with the anti-EcR antibody. Thus, it appears 
that most, if not all, of the endogenous binding activity 
in embryos and cell lines results form the resident EcR 
gene. 

Methods 
Extracts 

Tissue culture cell extracts for hormone and DNA- 
binding experiments were prepared as follows. Cells were 
grown in spinner flasks to a density of 5-7xl0 6 cells/ml, 
and were washed once in EcR buffer (25 mM Hepes, pH 7.0, 
40 mM KC1, 10% (v/v) glycerol, 1 mM EDTA, 1 mM 
dithiothreitol , and the following cocktail of protease 
inhibitors: 10 mM Na 2 S 2 0 5 , 500 /aM PMSF, 1 (M leupeptin, 
1 mM pepstatin) . All further manipulations were at 4*C. 
Cells were resuspended in EcR buffer at 2% of the 
original culture volume, divided into 3 ml aliquots, and 
sonicated using 30 1/2 second pulses with a probe 
sonicator (Bronson Sonif ier 450) , resulting in 
disruptions of -95% of the cells. After centrifugation 
at 100,000 x g for 1 hour, 100 ill aliquots of supernatant 
were frozen in liquid nitrogen, and stored at -80 'C. 
Protein concentration was determined using bovine serum 
albumin as the standard, and was typically 6-11 mg/ml. 
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Embryo extracts were prepared by a similar protocol: 

T^6HF6uir^<^ 

commercial bleach for 2 minutes, washed extensively in 
0.7% NaCl, and resuspended using 2 grams of embryos per 
ml of EcR buffer. Embryos were broken with 20 strokes in 
a Dounce homogenizer using a B pestle, and lysis was 
completed with the probe sonicator using the same 
settings as used for the tissue culture cells. The 
extract was adjusted to 400 mM KC1, centrifuged 1 hour at 
100 , 000 x g, and aliquots of supernatant were frozen. 
This extract contained 13.4 mg/ml protein. Before use in 
hormone binding, it was diluted tenfold in EcR buffer 
lacking KC1 to bring the final KC1 concentration to 
40 mM. 

flQrmone-bindinq assays 

For hormone-binding experiments, extracts were first 
diluted to the following concentrations in EcR buffer: 
0.9 mg/ml for MtHy and MtEcRHy extracts, 3 mg/ml for S2 
and SRS 1.5 extracts, 4 mg/ml for the Kc cell extracts, 
and 1.3 mg/ml for the embryo extract. All manipulations 
were done on duplicate samples in order to quantify 
variability in the results. For immunoprecipitation 
experiments, extracts* were immuno-depleted, mock- 
depleted, or left untreated. For depletions, 300 /il of 
diluted extract was incubated for 30 minutes at 25 °C with 
3.5 nl affinity-purified anti-EcR antibody, or with 
3.5 /il preimmune serum for the mock-depletion control. 
Then 38 /xl 10% Staphylococcus aureus (Pansorbin, 
Calbiochem) in EcR buffer was added, and incubation was 
continued for 15 minutes at 25 °C After centrifugation 
for 3 minutes in a microcentrifuge, the supernatant 
(depleted extract) was recovered. The 

immunoprecipitation was repeated, except in the case of 
the embryo extract, which was subjected to only one round 
of precipitation. The "untreated" extract aliquots were 
left at 4°C for the duration of the depletion procedure, 
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and were diluted with EcR buffer to match the final 
concentration of the depleted aliquots. 

A modification of the hormone-binding assay of P. 
Cherbas was used (Cherbas, P. 1988. Proc. Nat'l Acad. 
Sci. , U,S.A. 85:2096-2100). Assay tubes contained 140 pi 
extract , 14 pi [ 125 I] iodoponasterone, and either 14 nl 
EcR buffer or 14 /il unlabelled 20-OH ecdysone in EcR 
buffer as a competitor. [ 125 I] iodoponasterone was 2177 
Ci/mM and was used at a final concentration of 5xl0" 10 M in 
the assay; 20-OH ecdysone was 2xl0" 5 M final concentration 
in the assay. After incubation for 1 hour at 25 *C, each 
reaction was spotted on a dry Whatman GF/C filter 
(2.4 cm), and after 30 seconds the filter was washed by 
using a vacuum to draw 10 ml EcR buffer through the 
filter over a period of 1 minute. Filters were placed in 
800 /xl 4% SDS, and radioactivity was measured in a 7 
counter. The hormone-binding activities shown are 
saturable binding activities, calculated as the total 
binding activity, as measured in assays with no added 
competitor, minus the unsaturable binding activity, which 
was measured in the assays with excess unlabelled 
ecdysone added. In the most active extracts, the 
unsaturable activity (representing the large number of 
low affinity binding sites in the extract) was less than 
10% of the total activity. 

B. Genetic Regulatory Activity of the EcR Protein 

in vivo 

An ecdysone-inducible reporter plasmid, 
pEcRE/Adh/flgal (Figure 2) , was constructed to test the 
regulatory functions of the EcR protein in vivo . The 
reporter gene in this plasmid consists of the sequence 
that encodes the E. coli B-galactosidase (Bgal ORF) 
linked through the 5 1 leader sequence of the Drosophila 
Ultrabithorax gene (UBX leader and AUG ) to an ecdysone- 
inducible promoter. This promoter was created by fusing 
a truncated version of the proximal promoter for the 
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Drosophila Adh gene (PDAdh-34+53/ the numbers indicating that 
it consists of the s quence from base pair positions -34 
to +53 , which just includes the TATA box) to seven 
repeats of a 34 bp synthetic oligonucleotide (7 EcRE 
5 OLIGOS ) which contains the ecdysone response element 

(EcRE) from the ecdysone-inducible heat shock gene hsp 22 
(Riddihough and Pelham, 1987. EMBO J. 6:3729-3734). The 
seven EcREs should confer ecdysone-inducibility to the 
truncated promoter, provided that the cells transfected 

10 with this reporter plasmid contain the appropriate 

ecdysone receptor. 

This ecdysone-inducible reporter plasmid was 
constructed by insertion of the 7 EcRE OLIGOS into 
plasmid pAdh/Bgal, which is identical to pEcRE/Adh/ flgal 

15 except that it lacks the array of ecdysone response 

elements. The pAdh/Bgal plasmid should therefore not be 
ecdysone inducible and can serve as a control. To test 
these expectations, Sch-2 cultured cells (which were 
shown above to contain endogenous ecdysone-binding 

20 activity) were transfected with each plasmid and examined 

for B-galactosidase activity in the presence and absence 
of ecdysone. The ecdysone-induced 6-galactosidase 
activity in the pEcRE/Adh/Bgal transfected cells was 
2000-fold greater than when such cells were not exposed 

25 to ecdysone, whereas ecdysone had little effect on the 

pAdh/Bgal transfected cells. These results indicate that 
the EcREs confer ecdysone-inducibility on the PnAdh-34+53 
promoter, as expected, and that the Sch-2 cells contain 
functional ecdysone receptors. 

30 To test the function of the EcR receptor in such a 

system, host cells lacking functional ecdysone receptors 
are required. "Ecdysone-resistant" cells lacking 
ecdysone-binding activity, and hence, presumably 
functional receptors, can be produced by continuously 

35 exposing ecdysone-responsive cells to ecdysone during a 

period of several weeks. This ecdysone-resistant state 
is then maintained in ecdysone-free media for several 
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months. An ecdysone-resistant cell line, SRS 1.5, was 
therefore generated by growing Sch-2 cells in 3x10— M- 
ecdysone. The SRS 1.5 cells lack significant ecdysone- 
binding activity. 
5 When these cells were transfected with the 

pEcRE/Adh/Bgal plasmid and subsequently exposed to 
ecdysone, very little ecdysone-induced B-galactosidase 
activity was observed, indicating that the cells have 
only trace amounts, if any, of functional receptors. To 

10 test whether the expression of the EcR gene can "rescue" 

this deficiency, the SRS 1.5 cells were cotransfected 
with two plasmids: the ecdysone-inducible reporter 
plasmid, PEcRE/Adh/Bgal , and a constitutive expression 
plasmid for the EcR gene, pActEcR, in which transcription 

15 of the EcR ORF is .controlled by the Drosonhila Actin 5£ 

promoter, Pacisc (Figure 3) . Cotransfection with these two 
plasmids, followed by exposure to ecdysone, resulted in a 
dramatic induction of B-galactosidase activity. Thus, 
introduction of this EcR expression plasmid into the 

20 SRS 1.5 cells regenerated the ecdysone-inducibility they 

had lost. 



Methods 

Construction o f the pAdh/Baal. pEcRE/Adh/Bcral and pActEcR 
25 plasmids 

Plasmid pAdh/Bgal was constructed in two steps. The 
Bglll-Scal fragment of pDA5»-34, containing nucleotides - 
34 to +53 of the Drosonhila Adh distal promoter, was 

30 cloned into pUC18 cut with Seal and BamHI. The resulting 

plasmid was cut with EcoRl, and the EcoRl fragment of 
cPBbxd6.2 (containing the Ubx untranslated leader and 
AUG, the Bgal open reading frame, and the SV40 splice and 
poly A signals) inserted. 

35 To construct pEcRE/Adh/Bgal from pAdh/Bgal, two 34- 

residue oligonucleotides were synthesized: 

5 1 TCGAGAGACAAGGGTTCAATGCACTTGTCCAATG3 1 

3 ■ CTCTGTTCCCAAGTTACGTGAACAGGTTACAGCT5 ■ 
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These will anneal to form 30 bp duplexes with Sail 
compatible four nucleotid overhangs at their 5' ends, as 
shown. Further annealing via the 5' overhangs allows 
formation of tandem arrays that can be inserted into 
pAdh/Bgal at its Sail site just upstream from the TATA 
box of the truncated Adh promoter. When these 
oligonucleotides were kinased, annealed, ligated into 
Sail-cut pAdh/figal and cloned, pEcRE/Adh/Bgal was 
obtained. Restriction mapping showed that it contained a 
tandem array of seven 34 bp repeats, each of which 
contains the 23 bp ecdysone response element (EcRE) 
present in the hsp 27 gene, the remaining 11 bp 
representing flanking hsp 27 sequences and the 5» 
overhangs . 

The constitutive EcR expression plasmid, pActEcR, 
was formed by inserting the Fspl-Hpal fragment of an EcR 
cDNA containing bp 851-4123 that contains the ORF 
encoding the EcR protein (Table 2), into the EcoRV site 
of the ActSV40BS plasmid. This expression vector was 
constructed in two steps by inserting the Xbal-EcoRl 
fragment of cosPneoB-gal , containing the SV40 splice and 
poly A signals, into BlueScript+KS (Stratagene) cut with 
SacII and Xbal, blunting the EcoRl and SacII ends. The 
resulting plasmid was digested with BamHl and Apal, and 
the BamHl-EcoRl fragment of pPAc was inserted, with the 
Apal and EcoRl ends being blunted. 

Transfection and generation of the cell lir>e SRS 1.5 

The cell line SRS 1.5 was obtained by growing 
Schneider line 2 (Sch-2) cells in the presence of 3X10" 6 M 
20-OH ecdysone (Sigma) . This treatment initially halts 
growth of Sch-2 cells, but after several weeks the 
adapted cells grow well. SRS 1.5 cells were washed in 
hormone-free medium and passed several times in hormone- 
free medium prior to their use in transfection 
experiments. Cells wer transfected by the calcium 
phosphate technique. Cells were transfected with 10 ng 
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of each plasmid used; when only a single plasmid was 

being-transfeotedT- lO-Mg-of-pUGlS-DNA -was-added' as-a 

carrier. In general, all trans fections were carried out 
in duplicate. Twenty-four hours after transfection, 
cells that were to undergo hormone treatment were split 
into two dishes, one of which was treated with 2xl0" 6 M 
20 -OH ecdysone. 

fi-aalactosidase assays 

Forty-eight hours after transfection, 2 ml of cells 
were washed once in PBS (137 mM NaCl, 27 mH KC1, 65 mM 
Na 2 HP0 4 , 15 mH KH 2 P0 4 , pH 6.8), and were resuspended in 
50 /il of 0.25 M sucrose, 10 mH Tris, pH 7.4, 10 mH EDTA, 
and repeatedly frozen in liquid nitrogen and thawed in a 
37 9 C water bath for a total of 3 freeze/ thaw cycles. 
Cell debris was removed by a 10-minute centr if ligation in 
a microcentrifuge at 4'C. The concentration of protein 
in the supernatant (cell extract) was determined by the 
Bradford method, with bovine serum albumin as a standard, 
and was typically 1.5-2.5 mg/ml. Extracts were assayed 
immediately or frozen and assayed up to two weeks later 
with no loss in activity. To 10 /il of extract, or an 
appropriate dilution, 500 pi of assay buffer was added 
(0.6 mH 4-methylumbelliferyl-B-D-galactoside, 60 mH 
Na 2 HP0 4 , 40 mH NaH 2 P0 4 , 10 mH KC1, 1.0 mH HgS0 4/ pH 7.0) . 
After a 30-minute incubation at 37 *C, reactions were 
stopped with 500 /il of 300 mM glycine, 15 mM EDTA, 
pH 11.2. The fluorescent reaction product was quantified 
on a Perkin-Elmer LS-5B luminescence spectrometer, with 
Xar>365 nm and Aem=450 nm. Bgal activities are given as 
fluorescence units per nq protein assayed. 

C. Specific Binding of the EcR Protein to Ecdysone 
Response Elements 

The simplest explanation of the results described in 
the preceding section is that the EcR protein generated 
by the EcR expression plasmid binds to the EcRE of the 
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reporter plasmid and, in combination with ecdysone, 
activates th minimal Adh promoter in that plasmid . The 
following experiment was designed to test whether the EcR 
protein exhibits specific binding to this EcRE in vitro , 
5 Two plasmids were used; pUC18, which serves as the 

control, and pUC18-EcRE, which was generated by 
substituting the Hindll-Xbal fragment from pEcRE/Adh/Bgal 
that contains the seven repeats of the 34 bp EcRE 
oligonucleotide, for the Hindll-Xbal fragment of pUC18. 

10 Because the only difference between these two fragments 

is the seven oligonucleotide repeats, this is also the 
only difference between the two plasmids. 

The two plasmids were digested with ApaLI and Hind 
III, end-labeled with 32 P and mixed with an extract from 

15 MtEcRHy cells in which the EcR protein was overexpressed 

by Cu 2+ induction (see section A, above) . After a 
15-minute incubation at 25 9 C to allow EcR-DNA binding to 
occur, affinity-purified anti-EcR antibody was added. 
The 25 "C incubation was continued for an additional 

20 40 minutes, at which time anti-rabbit Ig-coated magnetic 

beads (Dupont Magnasort-R) were added, and the incubation 
continued 15 minutes more. The beads were separated from 
the solution magnetically, similarly washed, and the DNA 
eluted from the beads in 1% SDS at 65 # C. The eluted DNA 

25 was ethanol precipitated and fractionated by 

electrophoresis in an agarose gel, which was dried and 
autoradiographed . 

Only the fragment containing the EcRE 
oligonucleotide was specifically and efficiently 

30 registered on the autoradiographs , and that registration 

was dependent upon the anti-EcR antibody. Quantitative 
analysis of the autoradiographs demonstrated a 10 3 -fold 
preference for binding to the EcRE oligonucleotide over 
the average vector sequences, under the conditions of 

35 this assay (see Methods, bel w) . 
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According to the criteria stated at th beginning of 
this Experimental Example , "the" EcR protein "clearly 
satisfies the definition of an ecdysone receptor. 

Methods 

Conditions for the DNA binding assay 

A quantity of 0.2 fmole of digested, labelled 
plasmid DNA was mixed with 2 fig (dl/dC) in 10 /il of TE 
(10 mM Tris-HCl, pH 8.0, 1 mM EDTA) , and 90 /il of the 
MtEcRHy extract, diluted to 0.9 mg/ml in EcR buffer 
adjusted to 180 mM KC1, was added. After binding for 
15 minutes at 25 # C, 2 ml of affinity-purified anti-EcR 
antibody, diluted 1.5x in EcR, was added, and this 
incubation was continued at 25 # C for 40 minutes, when 
50 Ml of anti-rabbit Ig-coated magnetic beads (Dupont 
Magnasort-R) , exchanged into 180 mM KC1 EcR buffer, was 
added and the incubation continued for 15 minutes. 

The beads were washed twice in 400 pi 180 riM KC1 EcR 
buffer, and DNA was eluted from the beads by soaking 
twice in 200 Ml 1% SDS in TE at 65 # C. The eluted DNA was 
ethanol precipitated and run on an agarose gel, which was 
dried and autoradiographed. As controls, one half of the 
input DNA (0.1 fmole) was run on the gel for comparison, 
and the binding assay was carried out, leaving out the 
antibody. 

EXAMPLE IV 
RECEPTOR GENE MUTAGENESIS. 

30 Mutations in the steroid receptor superfamily genes 

can alter their function in two ways. Most obviously, 
they alter the sequences encoding the receptor proteins 
and thus alter the receptor function. Alternatively, 
they can alter the expression of these genes — an 

35 alteration that can be at any level of that expression 

from transcription of the gene to the translation of its 
mRNA(s) . Such mutations can change the timing of gene 
expression during development or change the tissue and 
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cell distribution of that expression, thus, profoundly 
changing the course of development. Furthermore, these - 
mutations provide information about the regulation of 
receptor gene expression, just as mutations that alter 
the structure of the receptors encoded by these genes 
provide information about the genes whose expression 
these receptor proteins control. In particular, 
mutations that alter receptor gene expression can lead to 
the identification of the proteins and other regulatory 
molecules that control their expression. Clearly, 
mutagenesis of insect steroid receptor superfamily genes 
provides an important avenue leading to an ability to 
interfere in a highly specific manner with insect 
development, and thus to control insect infestations 
deleterious to human health and agriculture. 

We have carried out mutagenesis experiments for two 
Drosophila members of the steroid receptor superfamily 
genes, E75 and EcR . that we have cloned and characterized 
with respect to their expression. In this experimental 
example, mutagenesis of the E75 gene is described. 

A. Deletion Mutations 

In Drosophila . genetic analysis for a given locus - 
- in this case, the early puff locus at 75B that houses 
the E75 gene — generally depends upon the isolation of 
deletions of all or part of that locus. Such deletions 
greatly facilitate the subsequent isolation of point and 
other small mutations within the locus. By isolating 
mutations that are revertants to the neighboring dominant 
Wrinkled (W) mutations, we have isolated and molecularly 
mapped the boundaries within our chromosomal walk (see 
Experimental Example I) of two deletions, W R4 and w R1 °, 
generated by gamma ray mutagenesis, the preferred way of 
generating such large alterations of genomic structure. 
One of these, W R1 °, extends distally from Wrinkled to 
cover the entire E75 g ne; and the other, W R4 , extends to 
a point about 90 kb upstream of the 5 1 end of the 50 kb 
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E75A transcription unit and does not include the E75 
gene. 

An F2 screen was then employed to screen for gamma 
ray-induced mutations mapping to the 200 kb distal region 
that is included in the W R1 ° deletion but not the W* 4 
deletion. This screen resulted in the isolation of five 
members of a single lethal complementation group that 
molecular mapping data demonstrate represents the E75 
gene. The most useful of these five mutations is the 
E75 X48 mutation. Molecular mapping of this mutation 
demonstrated that it is a 105 kb region that includes all 
of the E75 gene. This method provides an extremely 
efficient screen for other E75 mutations, i.e., by 
screening for mutations that cannot complement this 
deletion mutation. 

B. E75 Mutations Generated by Ethyl Methane 
Sulfonate 

The chemical mutagen ethyl methane sulfonate (EMS) 
was used for this screen, as it is the preferred method 
for generating point or small mutations. An F2 screen of 
15,000 lines resulted in the isolation of 23 penetrant 
mutations within the 105 kb region of the E75 X48 deletion, 
all of which turned out to be alleles of E75 . It appears 
that this 105 kb region was saturated by this screen with 
respect to lethal complementation groups, and hence, E75 
appears to be the only lethal complementation group in 
this region. Adding the five E75 mutations described 
above, a total of 28 penetrant E75 alleles have thus been 
isolated, several of which are temperature-sensitive 
alleles. 

Jnter se complementation studies among these alleles 
and examination of their phenotypes reveal a complex 
complementation group — a complexity that probably 
results from the fact that the E75 gene contains two 
overlapping transcription units, a 50 kb E75A unit and a . 
20 kb E75B unit that occupies the 3» end of the E75A unit 
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(see Experimental Example I and Table 1) . These alleles 
can be roughly divid d~into two groups : those -that — - 

cause lethality in early development, during the latter 
part of embryogenesis or during early larval development, 
and (2) those that cause lethality late in development, 
during the prepupal or pupal stages. 

This division correlates with the stages when the 
E75A and E75B units are expressed. Thus, E75A 
transcription is associated with each of the six pulses 
of ecdysone, including those that mark the embryonic and 
early larval stages. By contrast, E75B mRNAs are not 
observed until the end of the last larval stage, being 
particularly abundant during the pupal stage. This 
correlation invites the speculation that the early lethal 
mutations affect the expression of the E75A unit and its 
E74A protein, and that the late lethal mutations 
specifically affect the expression of the E75B unit and 
its E75B protein. This proposition can be tested by 
detailed molecular mapping of these mutations and further 
examination of their phenotypes at the molecular level to 
determine the causes of lethality. 

The mutants described here provide a foundation for 
the further genetic analysis of the E75 gene that will 
allow exploration of the requirements for appropriate E75 
expression and function and will identify structural and 
functional domains of E75. Some of the future E75 
studies will best be performed by its in vitro 
manipulation, followed by transformation of the 
constructs back into Drosoohila . Finally, it will be 
desirable to identify interacting genetic loci — 
interactions that may occur at the level of regulation of 
E75 expression or at the level of interaction of the E75 
proteins with those encoded by other genes. Such 
interactive genetic loci can be identified via the 
isolation of mutations that act as suppressors or 
enhancers of the E75 mutations. 
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Methods 

Strains 7~marker s arid chr omos omes~ 

For this aspect of the invention, the following 
strains, markers and chromosomes were used. Tu 2 was 
described by Lindsley (Lindsley, 1973. DIS 50:21). All 
other strains and mutations are as described (Lindsley, 
and Grell, 1968. Genetic Variation of Drosophila 
m&lanocraster , Publication 627, Carnegie Institute of 
Washington, Washington, DC) . ru h W* 4 e s ro ca was 
constructed by recombination between £u h W R4 sbd 2 Tu 2 and 
£t sbd 2 e s ro ca. The st in ri e p sbd 2 chromosome was 
constructed by recombination of st in ri p p with sbd 2 , in 
order to allow marking of this chromosome over W R4 and 

RIO 

ST 1V , and homozygosed by crossing to TM3, backcrossing to 
TM3, and mating of isogeneic sibling progeny. The st e p 
ell line was homozygosed by standard ionic procedures. 
Anto w and ns Rc4 are described in Scott et al. (1984) 
Proc. Nat 'l Acad. Sci. USA 81:4115-4119. The pupal 
lethals X19, g26, 013B . 8ml2 . iX-14 . 2612 . m45, pir fllOL, 
mz4l6, I3mll5 . 052 and wa49 are described in Sheam 
(1974) Genetics 77:115-125. All strains used to 
construct the strains described above and other strains 
were obtained from the Bowling Green and Caltech stock 
centers . 

TM1, TM3 and TM6B (Lindsley, and Grell, 1968. 
Genetic Variation of Drosophila melanoaaster r Publication 
627, Carnegie Institute of Washington, Washington, DC) 
are balancer chromosomes carrying recessive lethal 
mutations along with multiple inversions to suppress 
recombination. This allows the maintenance, as a 
heterozygote, of a recessive lethal chromosome in its 
original state. These chromosomes are also marked with 
convenient visible markers. 

Quantitative Southern blot mapping for detection of 
mutant lesions 
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DNA was prepared from adult flies (about 50) by 
~dounc^^ ~60~mM-NaClT 
10 mM EDTA, 0.15 mM spermine, 0.2 mg/ml proteinase K. 
The homogenate was added to an equal volume of 0.2 M 
Tris-HCl, pH 9.0 f 30 mM EDTA, 2% SDS, 0.2 mg/ml 
proteinase K, incubated at 37 9 C for 1 hour, and then 
extracted twice with buffer-saturated phenol and once 
with 24:1 chloroform/isoamyl alcohol. DNA was EtOH 
precipitated twice, hooking the pellet out without 
centrifugation. Southern blot hybridization was as 
described (Segraves, W. et al. # 1984. J. Mol. Biol. 
175:1-17). Where restriction fragment length 
polymorphism was not used in order to distinguish the 
parental chromosome from the balancer chromosome, 
quantitation of band intensity on genomic Southerns was 
achieved using a scanning densitometer. By using a 
control probe outside the mutant region, the amount of 
DNA in each track was internally controlled. Comparison 
of deficiency heterozygote to wild type bands, when 
normalized to a control band in this way, gives little 
deviation from the expected 1:2 ratio. 

Molecular cloning of mutant lesions 

Restriction fragments of the appropriate size were 
isolated by preparative low melting agarose (FMC) 
electrophoresis of about 20 fig of restricted genomic DNA. 
The 6 kb E R4 Xhol fragment was cloned into Xhol-cleaved 
ASE6DBam which is propagated as a plasmid in order to 
grow the vector and cannot be packaged without an insert. 
The 18 kb W R1 ° Sail fragment was cloned into the Sail site 
of AEMBL3, cleaved also with EcoRI for the biochemical 
selection method for the prevention of propagation of 
non-recombinant clones. The 7 kb EcoRI fragment 
containing the x37 breakpoint was cloned into EcoRI - 
cleaved A607. Plating of recombinants on the hf 1A strain 
RY1073 prevent d plaque formation by non-recombinant 
phage. The 14 kb x48 EcoRI fragment was cloned into the 
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EcoRI site of AEMBL4, which had been cleaved with BamHI 
to utiTize~the "biochemical—seTection^'-f or ^recombinants^" 
The breakpoint fragments of x44 and the recipient 
fragment were cloned into ASE6ABam. Libraries were 
packaged using A in vitro packaging extracts prepared as 
described in Hohn (Hohn, B. , 1979. Methods Enzvmol. 
68:299-303). After demonstration that each of the 
libraries gave a significant number of plaques only when 
inserts were included in the ligation, they were screened 
using restriction fragments capable of detecting the 
breakpoint clones. 

Gamma rav mutagenesis 

Adult males of the strain ruhW sbd 2 Tu 2 or st in ri 
E p sbd 2 were irradiated in plastic vials with 5000 rad of 
gamma rays from a Cs 137 source at a dose rate of 4300 
rad/minute. These were then mated to virgins of the 
appropriate strain, which were allowed to lay eggs for 
five days. 

EMS mutagenesis 

The primary lesion in EMS-induced mutations of 
bacteria and yeast is an alkylation- induced transition of 
guanine to adenine; most EMS-induced point mutations in 
Drosophila can similarly be explained on this basis. 
This change would be expected to convert, on the 
complementary strand, a C in the opa repeat element to a 
T, creating an in-frame stop codon (CAGCAA to UAGCAA or 
CAGUAA) . (Ethylnitrosourea, ENU, which has been reported 
to yield a higher number of mutations for a given amount 
of sterility, is also an alkylator; however, considerably 
more stringent precautions must be taken in handling this 
mutagen. ) 

EMS was administered at 0.025 M to unstarved 1.5-5 
day-old males in 1% sucrose solution (1.5 ml on two slips 
of Whatman #2 in a 350 ml milk bottle) . Starvation of 
the males for 8 hours before EMS administration resulted 
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in unacceptable levels of sterility, and males of the st 

^ p ^l^tra 

without starvation. Mutagenesis was monitored by 
crossing mutagenized males to attached-X FMA3 females. 
Other mutants seen in this screen included a large number 
of ca alleles (many mosaic) seen over TM6B in the Fl and 
F2 generations, a dominant brown allele, and two new 
mutants, Wink , a third chromosome dominant mutation 
resembling Bar , and a third chromosome dominant Curly - 
like mutation. Wink is easily scored (RK1) , has complete 
penetrance, and is quite healthy over TM6B. 

In the initial screen, vials were scored as mutant 
if they had fewer than 25% as many deficiency 
heterozygote as balancer heterozygote flies. On 
retesting, this was revised to 50% of the level seen in 
control crosses. Balancer heterozygotes were 
approximately two-thirds as viable as deficiency 
heterozygotes . 

In situ hybridizat ion and cvtoloaical analysis 

In situ hybridization of polytene chromosomes was 
carried out as described in Experimental Example I (see 
Methods , section A) . Cytological analysis was performed 
by squashing larval salivary glands in lactoacetic orcein 
(2% orcein, 50% acetic acid, 30% lactic acid) . 

Although the present invention has been described in 
some detail by way of illustration and example for 
purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced 
within the scope of the claims. 
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erAMTtDPT. .lncroa 2. JULUUUAO 

TCC OCT TTC CAT TAC OK 6T0 CAT TCC TOC GM GST TGC AM GOA TTC TTC GOC 
Sar Oly rho Mia Tyr Gly Val All rer Cyi Glu Gly Cya lya Gly rre n» Art 

m 

CKTKAKCMCMAMAKCMmCKCKTKMCMOMTCMCMTK 

Are rer Ila Gla Gla Lya Ila oia Tyr Are rro cya Thr Lya Am oia gu cya 

AOC ATT CTO CCC ATC AAT COC AAT COT TOT CAA TAT TOC CCC CTG AAA AM TOC 
rer ire Lm Axt Xw Are Are Are Are Cya Ola Tyr Cya Art Lre Lya Lya Cya 

UlUAJLAU.l..latrotl 3..CCAATTOCM 



ATT OOC CTO GOC ATO AST COC GAT OCT OTO COT TTT GOA COC OTO COO AM COC 
Ila Aia val Gly mt Aar Are Aap Ala V«l Art rre cly Art Val rre Lya Art 

11S2 114 

CM AM OCB COT ATC CTO GOB CCC ATS CM CM MC ACC CM MT CCC GOC CM 
Cly Ly» All Act Ila Lm Ala Ala MET Gla da Sar Thr Gla Are Art Gly Gin 

CMCCAGCCCTCGCCACCCMCTOGATCMCMCCACK 

Gla Art Ala Lau Ala Thr Glu Lm Aap Are Gla rre Art reti Lm Ala Ala val 

CTO COC CCC CM CTC CM MC TOT GM TTC ACC AM GAG AM GTC TOO GOB ATO 
Lm Art AU Are Lm Glu Thr cya Glu rre Thr Lya Glu Lya val sar Ala MT 

III 

gpAGTCTOU.lfCrea «.Ja UUjLAC 
COS CM COB CCS COB CAT TOT ttt w wt» »w: w« mm mm «m> w* 



AM TTO CTG CM MB ATO CCC GAT CTG COC ACC CTO AOC ACC CTO CM AOC CmO 

LyiiajUBaimnrnreiMicinrlajawluiaiiuaifiiB 

AM CTG OTA GTT TTC CSC AOC GM CM AM GM CTO CTO COC CM CM ATO TOO 
Lya Lm Val VAX Tha Art tAr Glu Ala Lya Gla Lm Lm Art UU Gla fCT Try 

3W 

2112 

TCC ATG GM GM GOC AM AM MC GAT GOC CM CM AM AM TOO COC TOG CCC 
sar IBS Gla Are Gly Am am lor Aap Gly Ola Gla Am Ayr sar rro rer cif 

AOC TOO GOO GAT GOC ATO CM CTO CM OCB QBC AM AST OCO CTT OOC TCC GTA 
Sar Trp Ala Aap Ala MET Are Val Gla Ala Ala Lya Sar rro Lm Gly sar val 

TCB MC ACT GM TCC CCC GM CTC GAC TAC OOC ACT OCO AOC MT TOO CM CCA 
lor Sar the Glu sar Ala Aap Lm Am tyr Gly sar rro Sar rer sar Gla rro 



rro rra Gla Gla Gla rro Mr Ala Lau AU Aar 



Ola Gly Val tor Lm ffs 

too Oct cer cro cto oco occ mo crc tcc oca oca tot ccc crs ok am coo 

Sar Ala rre Lm Lm Ala Ala Thr lm sar cly Gly cya ire Lau Art Are Ait 
2132 

OK AAT TCC OK TCC MC GST GM TCC CCA OCA OCT GM ATO GAT ATC 0TT OK 
AU Are sar Gly sar sar Oly Aap sar Gly Ala Ala Gla MT Aap Xla Val Gly 

702 

TCB CM GCA CAT CTC MC CM AM GGS CTG MA ATC ACS CCS ATT GTS GBA CM 
Sar Ala Ala Mia Lm Tbr Gin Are Gly Lm Thr Ila TAr Pro Xla val Art *ia 

C lMTATCTT..la tro a S..TTTCTTACM 

CM CM CM CM CM CM CM CM CM ATC GOA ATA CTC AAT MT OCB CAT TCC 
Gin Gla are GU Gla Gla Gla Gin Gla IU Gly IU Lm am am Ala Are Sar 

OK AM TTS AAT GOB GGA CM GOB ATO TK CM CM CM CM CM CM CCA CM 
Art Am uw am Gly Gly Ala AU MT Cya OU GU GU GU GU Mia rro Gla 

g arenas) 

CTO CM CM CM TTC ACA GCC GCA OCT OCC CK TK ASA AM OA CAT TOO OR 
Lm AU nil HU Lm thr AU Gly Ala AU Art Tyr Art lya Uu Aap sar rro 

Art 

2702 

AGS GAT TCB GK ATT GM TOO OK AAC CM AM AM GM TK AM GEO CTG ACT 
Thr Aap Sar Gly IU Glu Sar Gly Are Glu Lya Am Glu Cyt Lya Ala Val Sar 

TCB GGS GGA AST TCC TCB TK TK ACT CCC COT TCC AST OTO GAT GAT OCQ CIO 
Sar Gly Gly Sar Sar Sar Cya Sar sar rro Art Sot sar val Are Are Ala Uu 

110 

GM TGC MC GAT GK OK GCC AAT CM AAT CM OTO GTS CM OT CCC CM CTG 
Aap Cya Mr Aap AU Ala Ala Are Mia Are GU Val Val Gla Mia rre Gla Uu 

MT CTG GTS TK GTS TCA CCA GTT CK TCB CK CM CCC TK MC MC MC CAT 
sar val Val sar val Sar rro Val Are sar rre GU rro Sar Thr Sac rer Ala ^ 

CIS AM CCA CM ATT OTO GM GAT ATC CK GTC CTO AM CK CTG CTC CM KT 
Lm Lya Are GU Ila Val GU Aap MET rro val Lm Lya Art Val uu Gin Ala 

•M 

2172 

CK CCT CTG TK CAT AK AM TCB CTG ATO GM GM GCC TK AM CCO CM AM * 
rre rro Lou Tyr Are thr Are Sar Lm HIT Aap Glu AU Tyr Lya rro Hla Lya 

AM TTC COS GK CTC CCC CAT CK GM TTC CM MC OK GM CCC GAT KC AOC 
Lya rha Art Ala Lm Art MU Art Olu trm Glu thr Ala Glu Ala Are Ala Sar 

AST TK ACT TK CK TCC AM AK CTO ACT OK GK AST CCS CK CM MC CCA 
Sar Sar thr sar Gly Sar Are ter Lau sar AU Gly Sac Ire Are Gla sar rro 

Ul 

CTC CCC AM MT GIG GK ACS CK KC CCA TOO OCO GK AK OK OK OCA GXZC 
Val rro Aaa Sar val Ala Thr rro rro rre sar AU Ala tor AU Ala AU Cly 

MT CCC CCC CM MC CM CTO CM ATO CM CTO MC CK MC MC CK AM GK 
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B 



mi m Ail m *m mi mi vii Ait lyt sn in nr Ala ciu fro *tq mmmmmm»w- 



A!QMBOBeMG»ISaM0CCUCGtfMTATCCMMCTKnQM6CaC 
ICT At f(« Cia OA «T Lyi Alf SM Alp II* Il« CiA Am fyz Ly. AT* 

a« AM Mr 1kt Ala Alt Stt lit Thr nr Am Cly val Gly am Ai« lv>n 

AOC aoC AOC ICC ACA OCS CCC CCC TCC GCC CTC 00 AA7 CAC CM OCT tOB OK 
M m hi Mr fttfronornJAtAUmciBAiafilftGlAAn Ttp Cly 

103C 

j^JTCTCOOTCATCACCACCACCTOCCMCMCeCCMC^ 

wr Mr Mr vil Ut nr m Thr Cyi OlA SiA Ar« filfl Cln Aft V«i Mc fro 

1312 

CM MC AM OBT tOC MC TOC ACT TCC ACC TCT AOC TCC AOC TCC ACT TOB TCA 

im m A« oiy m am «r ««r «« Atr im far ui uc »« i« 

TOC TCC TOC ASA TCC TOC AM TOC AOC TOC AOC TOB ODE AOC AOC TOC CAC TAT 
*tmz** to *t *m* cyn to •** imz to AX* M~ 9m* cr» <^ ^ 

Vm Cla Mr fro Al. Air Thr Air Am Cly Thr to All fro All to Mr to 

TOO OBA TOB AAC AOC OOC AOS OOC CTO CTO GAA CIO CM OTO CMS ATT OCT CAC 
Mr Gly Mr AM to AH mr fro Uu Lbu> Ola Uu Ola vil am 1M ah am 

TOB 000 CAS OCT CTC AAT TT6 TCC AM AAA TOB OOC AOB OCB COB OOC AOC AAS 
Mr Al« filA rn Am aia Im Au iyt Mfi im in fht fro rn fro Mr Lya^ 

37t2 

CTO CAC OCT CTO OTO OOC OOC OS AAT OOC OTT CAA AOB TAT OOC ACA TTB TOC 

lh ui All im «ai aia au aia am All va Oil An ryr f n nr IM Air 

3CC CAC OTC ACA OTO ACA OOC TOC AAT OOC OOB TOC TOC CTC OOC OOC OOC CAS 
Ala am Vil Thr Vil Thr Ala Mr am Cly Cly Mr Mr val Oly Cly Cly Glu 



Mr Gly am Ola em Ola Mr Ala Cly Glu Cya Oly Uu fro Ola Mr Cly f ro^ 



aBJC^AflPBJCJfi 



f ffy- t r^M^n^wyiM mi^iii. iiaa a^CTTflOCTOOST^ 



CTCTOBCTOB TJOffltttTOCTOOTO BAflBS WO 1 reATTCOCAMTTOCTCOTCflt 1U 

I a» MM acart »«• 



lAXOCTACAOCAAOC 



CAAAATCTCTAATTCAAAT ATO CTT TOT OCA ATS CAA 

mr val Cya Ala *T cm 



GAS CTT OCT OCT OTO CAS CAT CAS CAS CAS CAA CAS CAA CTC CAS TTB OCC CAfl 
Glu Val Ala Ala Val OU HU Ola Gla Cla 61a Cla Cla Lav Cla Imu fro 61a 

CMCAACMCBflCASCJBCASACAACACJeC 
Gla ciii Cla Ola cia Cla Cla rar thr Cla Cla Gla ma Ala thr 



24 

AOB ATA CTC 
thr Ila val 



CTO CTO AOB OOC AAT OOC OOC OCT AAT CIS CAC ATT CTC 006 ACA OCB CAA CAC 
Lau Lmh thr Qly Am Oly Clf Cly am 1m Ala 11a v*i All thr fro Gla Cla 

CAT CAS COB ATO CAT CAS CTC CAC CAT CAS CAT CAS CAT CAB CAT CAS CAC CAC 
Mia Gla fro NCT Mia Ola Inn Ala Ala Cla Ala Cla AUa cla Mis da an. Gla 

CAS CAS OCC AAS AOC CAA CAS CTC AAS CAA CAA CAC TCO GOB CTC CTC AAS TTO 
cla Gla All l>yi Mr Cla Cla 1mm Ly. Cla Gla aia Mr Ala Imo vil Lya Lau 

S Sly £2 Ala fro lla Ly. Gla Gla Gla Cla Thr f» Lyi Gla Urn wn Tyr 



GAO OOC GDC GET OCA CAA OCT AAT OCT OCA OOC CTDA ACA OCS OCA OA COT AOB CTO CAS CAS CAS CAS CAS CAA COB CAA COC AAA ACA CTO AAA AAC GAA *EA ac* 

Su am Arc am All Gla Gly am Ail cly Cly val An Ala Oly Oly Gly Am Lao Cla Cla Gla Cla Gla Cla fro Cla Am Lya An La» Lya Aaa Cla au am 

Ate OTA CAA CAS CAA CAA CAA ACA OCT OCA ACA CTA OTA AM ACA ACA ACC AOC 

*5™ciA™CLaCMcStmrf»AMT^ 

ACC AAC ACC AAC AOC AAC AAC ACC CM ACA ACA AAT AST ATT ACT CM CAS CAA 

Mr am Mr Am Mr am am Thr Gla Thr Thr Am Mr Ila Mr Ola Gli cla 



TOO TTT TM OOB QM AM TOC CM ACA CM MA CTO OBA CTO OCA CTT CM OBA 
Tip PAt Tyr All Glu Lya Tip Ola am Cla Am Imo Cly val Ala Val Cla Arc 

4053 

AOC ASS AM CM GAT CM TTO CM CBS COB CM TTB AAT TAA 
Mr am Lyi Gla An Ala Lou civ An An Civ Law am . 

1231 



43M 



TTTAAAAAACAAAXAAACAAACAAM 

mjnvniw rr~*" w - m * m ** mr * 9 nn mmnifiiii arir ■ ■ >■ aatat* aitBTiT rm 

STAAACTTTAAflBTMCAMAAACCAACI 



4114 



CAACOCTTCCTCCMACACCAMCCTCTTT 



CAG CM CAT CM ATT CTO TTB CM CM CM CM CCA GCC OCB OCA OCA ACA OCA 
Gla Cla AU. GLa ILa vil bw Cla ana cla Gla fro Ala Ala Ala al. thr fro 

AM CCA T6T OOC GAT CTO AOC OCC AAA AAT CM AOC CM TCO CSC AIC CM GAG 
Lya fro Cya Ala Asp Lao Mr Ala Lyr Aaa An Mr Gla Mr Cly XI. Arp Glu 

GM TCC COC AM AOC CAT GM GAT TCC CCC AAT GCC AM COS OOC OOC ACA TCO 

A«p cy. tro Am Mr am Glu Am Cyi fro Am All am fro Ala cly Thr Mr 

CTC GM GM ACC MC TM GM CM TAT CM TOC CCC TOB AM AM ATA CCC TAT 

25 2u a* to to Tyr Clu Cla Tyr cla cya fro tip Lya Lya Ila Arc Tyr 



ASM 

TTAOCOCTTTMTTC 



aTAMTTTTAAflCTCTrOT 1U I 1 CATATTA A1 fAOBTI 1A TTOGTAACTATCT 
GMCAAAACT M111ULUI11U1A1UU111IJA AM 



CCOCCTGMCTCCTCJuVICMCOCCMTTBCMCMCMCMA^ 

Ala Arc Su Lau uZ Lya Cla Aro Clu Lau Clu Cla Gla Ola Thr Thr Gly 



Cly 



I » poly A alt* oTM1t3T inA c0a*S3t 
ATTCATTTTTTATGA 



47*4 

ACCCOTATTCCfOAtTT 



tCAAMCTCTTCTCTATa 



nCTGTACATMTTAAATCTTAMTTAAACACAAAA M 



TTMTGAAGAAAtli 0AT0CTTAAAACATTCMTC1 1 ICM fOOCATCTTAl 



5114 

AACAAAACACAAAACITQCOOTMCAlCAi 



HAAAHTAAAIAAfJUIT AAAAAAT 



lixiM iimmI for lona ttiMcripci 



MC AM GOB CM CM CM CTC CM GOB AM CCA OCT OCA ATA CCC ACC MC AM 

to AM Ala cS cS Cla Vil Glu All Lya fro Ala Ala Xlo fro thr Mr am 

AT? AM CM CTC CM TOT GAT ACT CCC TTT TCO GOB CM MC CM AM CAA ATC 
Ila Lya Cla Lou £» Cya An Mr fro f M Mr All Gla thr Ila Lya Clu II. 

em ur ere era eoe CM CM tec cm caa caa cas erf gto goc acg cm cm 
All Am Lm Imu Arc Ola Cla Mr Gla Gla Gla Gla v.1 Val Al. Thr Gla Gla 

CM CM CM CM CM CM CM CM CM CM CM CAA CAA OBA MC CAT MC TCC 
Ola Cla cia GU Cla Gla Gla Ala Gla Kla Gla Cla Gla An Aro An Mr Mr 

CAC AGC AM rOZ TCO CTC ATG MC AM TOC ACC AM TCC AST OCC OSC AAT TCT 
S££ 522SurA«Au W A«*,r*irAl.(U,^ S 

.„ (XT' tae CAC CM CM CTO GM GM ATO GM GM GCC CM 

HFrrr err trr- tar GIT GAA CTT TCC CM CM CAT CM CM CSA CTC CM TCC 
Xjp Mr Gl^ Cy^ Arp Aap Clu 2u Cya Clu Gla Hi. Kla Gla Ara Imu a. P to 

TOC CAA CTC MT TM CTO TOC CM AM TTC GAT GM AAA CTC CM MG GCC CTC 
Mr Gla Imu An tyr Lou Cya Cla Lya fha An Clu Lya Lau An Thr Ala lju 

l«CAMMCMCOCCAMACOOOGAa3AMA«CCACCT 

Mr am Mr Mr All am thr Gly Arc am Thr fro All v * 1 Thr u * Citf 

144* 

CAT OCC CAT OtaoOCttlC 
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A 

ACTTACTAGTGAAAAACATX^TAATAJ^C^ 



CTTATC7TTAAAAAAAI*GGTGAGATTGTAAC^ 

ACTc^TTTgn^a«cam7rjg 



TX^TQGA1X»^G»C^TGTTa^AG TQ^ 



^ ► mRNA starr site 



^TTAAGCAGTGTAGTGAGTOCCCOGTGAAAAGGATAAGCCAAAAAGTGATT^ 

+ 211 

^TTATCAGTGTCAAGAAAAC^IETAAACTTGGCTCAAAAAGGGCTTTAA 



GAAGGATAAAATAATATtXZaCGAGTXgnxaAAM 



AAATTTTTTTAATAAATATTCCAAAAATATTGCOCCTGAAAAGTCnTGATAAA 
380 

ATG TTA ATG TOC GCG 
MET Leu MET Sor Ala 



ACG GTC AGT GCC AQC 
Thr Vai Sor Ala Sor 



gca cca car att ttg 

Aia Pro Pre I1q Lou 
542 

CTA CCG CCA AAC ACA 
Leu Pro Pro Asn Thr 



ATG GGC CAG AAT TAC 
MET Gly Gin Asn Tyr 



GAC AGT TCA GAT AGC GCC AAG ACT TCT GTG ATC TGC AGC 
Asp Ser Sor Asp Ser Ala Lys Thr Sor Vai lie Cys Sor 



ATG CTA GCA CCA CCA GCT CCA GAA CAG CCC AGC ACC ACA 
MET Lou Ala Pro Pro Ala Pro Glu Gin Pro Sor Thr Thr 



GGG GTA ACA GGT CGA TCT CAC CTG GAA AAT GCC CTG AAA 
Gly Vai Thr Gly Arg Sor His Lou Giu Asn Ala Lou Lys 

54 

AGT GTT TCG GCT TAC TAC CAG CAC AAC AGC AAG CTG GGC 
Sex Vai Ser Ala Tyr Tyr Gin His Asn Ser Lys Lou Gly 



AAT COG GAA TTC AGG AGC CTG GTA GCA CCT GTC ACA GAT 
Asn Pro Glu Phe Arg Ser Leu Vai Ala Pro Vai Thr Asp 



CTG GAT ACT GTG CCA CCC ACA GGT GTG ACC ATG GCG AGT TCT TCG AAT TCT CCC 
Leu Asp Thr Vai Pro Pro Thr Gly Vai Thr MET Ala Sor Sor Sor Asn Ser Pro 

108 

AAC TCC TCC GTC AAG CTG COC CAC AGC GGC GTG ATC TTT GTC AGC AAA TCG AGT 
Asn Sor Ser Vai Lys Leu Pro His Ser Gly Vai lie Phe Vai Sor Lys Ser Sor 



GTC AGC ACC ACC GAT GGT CCC ACT GCA GTG TTG GAA CAG CAG CAG CCG CAG 
Ala Vai Sor Thr Thr Asp Giy Pro Thr Ala Vai Lou Gin Gin Gin Gin Pro Gin 

812 

CAG CAA ATG CCC CAG CAC TTC GAG TCC CTG CCC CAC CAC CAC CCC CAG CAG GAA 
Gin Gin MET Pro Gin His Phe Glu Ser Lou Pro His His His Pro Gin Gin Glu 

162 
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812 

CAG CAA ATG CCC CAG CAC TTC GAG TCC CTG CCC CAC CAC CAC CCC CAG GAG GAA 
Gin Gin MET Pro Gin His Pho Giu Sor Lou Pro His His His Pro Gin Gin Glu 

162 

CAC CAG CCA CAG CAG CAG CAG CAA CAA CAT CAC CTT CAG CAC CAC CCA CAT CCA 

Hrs - Gto ~p. ro - G1 - n ^ 



CAT GTG ATG TAT COG CAC GGA TAT CAG CAG GCC AAT CTG CAC CAC ICG GGT GGT 
His Val MET Tyr Pro His Giy Tyr Gin Gin Ala Asn Lou His His Sor Gly Gly 

ATT GCT GTG GTT CCC GCG GAT TCG CGT CCC CAG ACT CCC GAG TAC ATC AAG TCC 
Ilo Ala Val Val Pro Ala Asp Ser Arq Pro Gin Thr Pro Glu Tyr lie Lys Sor 

216 

TAC CCA GTT ATG GAT ACA ACT GTG GCT AGT TCG GTA AAG GGG GAA CCA GAA CTC 
Tyr Pro Val MET Asp Thr Thr Val Ala Sor Sor Val Lys Gly Glu Pro Glu Lou 



QTGAGTTCTG. . int ron 1 . .TTCTTTGCfrg 



1082 

AAC ATA GAA TTC GAT GGC ACC ACA GTG CTG TGC COC GTT TGC GGG GAT AAG CCC 
Asn I la Glu Pho Asp Gly Thr Thr Val Lou Cys Any Val Cys Gly Asp Lys Ala 



GTAAGTTCGT. . int ron 2. •ATCGTTT 




TCC GGT TTC CAT TAC GGC GTG CAT TCC TGC GAG GGT TGC AAG GGA TTC TTC OGC 



Sor Gly Pho His Tyr Gly Val His Sor Cys Giu Giy Cys Lys Giy Pho Pho Arq 

COC TCC ATC CAG CAA AAG ATC CAG TAT CGC CCG TGC ACC AAG AAT CAG CAG TGC 
Arq Sor Ilo Gin Gin Lys Ilo Gin Tyr Arq Pro Cys Thr Lys Asn Gin Gin Cys 



AGC ATT CTG COC ATC AAT COC AAT CGT TGT CAA TAT TGC COC CTG AAA AAG TGC 
Sor Ilo Lou Arq I!q Asn Arq Asn Arq Cys Gin Tyr Cys Arq Lou Lys Lys Cys 

CTGAGTACCT . • int ron 3 . . CCAATTGCAG 



ATT GCC GTG GGC ATG AGT CGC GAT GCT GTG CGT TTT GGA CGC GTG CCG AAG CGC 
Ilo Ala Val Giy MET Sor Arq Asp Ala Val Axq Pho Gly Arq Val Pro Lys Arq 

324 

1352 

CAA AAG GOG CGT ATC CTG GOG GCC ATG CAA CAG AGC ACC CAG AAT CGC GGC CAG 
Glu Lys Ala Arq Ilo Lou Ala Ala MET Gin Gin Sor Thr Gin Asn Arq Gly Gin 



CAG CGA GCC CTC GCC ACC GAG CTG GAT GAC CAG CCA CGC CTC CTC GCC GCC GTG 
'Un Arq Ala Lou Ala Thr Glu Lou Asp Asp Gin Pro Arq Lou Lou Ala Ala Val 

CTG CGC GCC CAC CTC GAG ACC TGT GAG TTC ACC AAG GAG AAG GTC TCG GCG ATG 
Lou Arq Ala His Lou Glu Thr Cys Giu Pho Thr Lys Glu Lys Val Sor Ala MET 

3 7 0 



gTAAGTCTCA. . int ron 4 . . ATTTCT 




COG CAG COG GOG COG GAT TQC CCC TCC TAC TCC ATG CCC ACA CTT CTG GCC TGT 
Arg Gin Arg Ala Arg Asp Cys Pro Sor Tyr Sor MET Pro Thr Lou Lou Ala cys 
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CCC CTG ht*S v.*C GCC CCT GAA" CTG CM TCG GAG CaG fa.': - 

Pro Lou Asn Pro Ala Pro Glu Leu Gin sir £ Pno * r SJ £ 

1622 

^^C^^GTA.ATT^Cac. GOC^CTG ATC GAC T Grr r-r- ^ » 

U. «, V.1 H. Arq «, ^-g ^ K g? i g £ gg^gg 

432 

CTG CTC ACC CAG GAC GAT AAG TTC ACG CTC CTG AAG GCG GGA C ~~ 

Leu Thr Gin Asp Asp Ly 3 Pho Thx Leu Lou Lys AU G^y Lou Pno ^ 

CTG TTT GTG CGC CTG ATC TGC ATG TTT GAG TCG TCG ATA AAC -tjl a-y- 

Leu Pho Vai An, Leu lie C ys MET Pho Asp Ser IS £1 JUS S iu S SI 

CTA AAT GGC CAG GTG ATG CGA COG GAT GOG ATC CAG AAC GGA Gcr ut rr**» »~— 
-u Asn Giy Gin Val MET Arg Arg Asp Ala Ilo Gin Asn Giy Ala Asn Ala Arq 

486 

TTC CTG GTG GAC TCC ACC TTC AAT TTC GCG GAG CGC ATG AAC arc w 

Ph ° ^ Val A »P S - «* ^ Asn Ph. Ala Glu S K i£ S MET & S 
1892 

-Si ?S X 8 ^ CTO T7C « ATC GTT CTG ATT ACG CCG GAT CGC 

.hr Asp Ala Glu Ilo Gly Lou Pho Cys Ala Ilo Val Leu Ilo Thr fro Aso Arg 

CCC GGT TTG CGC AAC CTG GAG CTG ATC GAG AAG ATG TAC T^G cga rrr* 
Pro Gly Leu Arg Asn Leu Glu Leu lie Glu Lys MET Tyr Sor Arq Leu Lys Gly 

54 0 

TCC CTG CAG TAC ATT GTC GCC CAG AAT AGG CCC GAT CAG CCC GAG T*** CTG rr— 
Cys Leu Gin Tyr He Val Ala Gin Asn Arg Pro Asp S £u S E 

AAG TTG CTG GAG ACG ATG CCC GAT CTG CGC ACC CTG AGC AC*" T* r&r- 

Lys Leu Leu Glu Thr MET Pro Asp Lou Arg Thr Leu Sor Thr Lou His Thr Glu 

AAA CTG GTA GTT TTC CGC ACC GAG CAC AAG GAG CTG CTG CEf* ra<- m~ 

Lys Lou Val Val Pho Arg Thr Glu His Lys Glu Leu Leu Arg Gin Gin MET Trp 

2162 594 

sor MET Glu Asp Gly Asn Asn Sor Asp Gly Gin Gin Asn Lys Sor Pro Ser cTJ 

A ^ CAC CTG GAG GCG GCC AAG AGT CCG CT~ CTjt m a 

Ser Trp Ala Asp Ala MET Asp Val Glu Ala Ala Lys Sor Pro Leu Gly Sor vai 

T^ 5 AQ: " ACT TCC CTG GAC TAC GGC AGT CCG AGC AGT TC ear 

Sor S.r Thr Glu Sor Ala Asp Leu Asp Tyr Gly Sor Pro Ser Sor Ser Sn S 

648 

e^ISSI?nf ro ^ ra ^ OT ^^^^^c<:TCTCGccAa 

Gin Gly Val Ser Leu Pro Ser Pro Pro Gin Gin Gin Pro Ser Ala Leu Ala S« 

TCG GCT CCT CTG CTG GCG GCC ACC CTC TCC GGA GGA TGT CC~ CT r*— &f 
Ser Ala Pro Leu Leu Ala Ala Thr Leu Ser Giy Gly Cys Pro Leu Arg Asn Arg 
2432 

a?« A ^ CCA GCT GAG ATG GAT A'C GTT CCC 

Ala Asn S r Gly Ser Ser Gly Asp Ser Gly Ala Ala Glu MET Asp if SI Sy" 

702 
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2432 

GCC AAT TCC GGC TCC AGC GGT GAC TCC GGA CCA GCT GAG ATG GAT ATC GTT CCC 
Ala Asn Sar Gly Ser Ser Gly Asp Ser Gly Ala Ala Glu MET Asp He Val Gly 

702 

-TCG-CAC- GCA-CAT-CTC-ACC-CAG-AAC-GGS-CTG_ACA-ATC_ACG_CC3_ATT^ 

Ser His Ala His Leu Thr Gin Asn Gly Leu Tiir II Thr Pro He Val Arg His 

G JAGTATCTT. . lr .t r on 5 . ♦ TTTCTTAC >G 

CAS CAG CAG CAA CAA CAG CAG CAG CAG ATC GGA ATA CTC AAT AAT GCG CAT TCC 
Gin Gin Gin Gin Gin Gin Gin Gin Gin lie Gly lie Leu Asn Asn Ala His Ser 



CGC AAC 
Arg Asn 



cro cac 

Leu His 

2702 
ACG GAT 
Thr Asp 



TTC AAT GGG GGA CAC GCG ATC TGC CAG CAA CAG CAG CAG CAC CCA CAA 
Leu Asn Gly Gly His Ala MET Cys Gin Gin Gin Gin Gin His Pro Gin 

756 

G(Dro4 925) 

CAC CAC TTC ACA GCC GGA GCT GCC COC TAC PC* AAG CTA GAT TCC CCC 
His His Leu Thr Ala Gly Ala Ala Arg Tyr Arg Lys Leu Asp Ser Pro 

Arg 

TOG GGC ATT GAG TOG GGC AAC GAG AAG AAC GAG TCC AAG GCG GTG AGT 
Ser Gly He Glu Ser Gly Asn Glu Lys Asn Glu Cys Lys Ala Val Ser 



TOG GGG GGA AGT TCC TOG TOC TCC AGT CCG CGT TCC AGT GTG GAT GAT GCG CTC 
Ser Gly Gly Ser Ser Ser Cys Ser Ser Pro Arg Ser Ser Val Asp Asp Ala Leu 

eio 



GAC TGC AQC GAT GOC GOC GOC AAT CAC AAT CAG GTG GTG CAG CAT CCG CAG CTC 
Asp Cys Ser Asp Ala Ala Ala Asn His Asn Gin Val Val Gin His Pro Gin Leu 



AGT GTG GTC TCC GTC TCA CCA GTT COC TOG COC CAG CCC TCC ACC AGC AGC CAT 
Ser Val Val Ser Val Ser Pro Val Arg Ser Pro Gin Pro Ser Thr Ser Ser His 



CTC AAG CGA CAG ATT GTG GAG GAT ATC 
Leu Lys Arg Gin He Val Glu Asp MET 

2972 

CCC CCT CTC TAC GAT ACC AAC TOG CTC 
Pro Pro Leu Tyr Asp Thr Asn Ser Leu 



COC GTC CTC AAG CGC GTG CTC CAG GCT 
Pro Val Leu Lys Arg Val Leu Gin Ala 

364 

ATC GAC GAG GCC TAC AAG CCC CAC AAG 
MET Asp Glu Ala Tyr Lys Pro His Lys 



AAA TTC COG GCC CTC COG CAT CGC GAG TTC GAG ACC CCC GAG GCG GAT OCC AGC 
Lys Pho Arg Ala Lou Arg His Arg Glu Phe Glu Thr Ala Glu Ala Asp Ala S«r 



AGT TCC ACT TOC CGC TOG AAC AGC CTC AGT GOC GOC AGT CCG CGG CAG AGC CCA 
Ser Ser Thr Ser Gly Ser Asn Ser Leu Ser Ala Gly Ser Pro Arg Gin Ser Pro 

918 

GTC CCG AAC AGT GTG GCC ACG COC COG CCA TOG GOC GOC AGC GOC GOC GCA GGT 
Val Pro Asn Ser Val Ala Thr Pro Pro Pro Ser Ala Ala Ser Ala Ala Ala Gly 



AAT OCC GCC CAG AGC CAG CTC CAC ATC CAC CTC ACC COC AOC AGC COC AAC GCC 
Asn Pro Ala Gin Ser Gin Leu His MET His Leu Thr Arg Ser Ser Pro Lys Ala 
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3242 

TCG ATG GCC AGC TCG CAC TCG GTG CTG GCC AAG TCT CTC ATG GCC GAG COG COT 
Ser MET Ala Sor Ser His Ser Val Lou Aia Lys Ser Lou MET Ala Glu Pro Arc 

972 

" ATG~ACG^CCC~GAG~CAG~ATG -AfcG-CGC~AGC-GAT-AT7~ATG-GAA-AAS™T^ — 

MET Thr Pro Glu Gin MET Lys Arq Sar Asp lie Ilo Gin Asn Tyr Lou Lys Arq 



GAG AAC AGC ACA OCA GCC AGC AGC ACC ACC AAT G3C GTG GGC AAC COZ AGT CCC 
Glu Asn Sor Thr Ala Ala Sor Ser Thr Thr Asn Cly Val Gly Asn Arq Sor Pre 



AGC AGC AGC TCC ACA CCG CCG CCG TCG GCC GTC CAG AAT CAG CAG CGT TOG GGC 
Sor Sor Sor Sor Thr Pro Pro Pro Ser Ala Val Gin Asn Gin Gin Arq Tra Giy 

1026 

AGC AGC TCG GTG ATC ACC ACC ACC TGC CAG CAG CGC CAG CAG TCC GTG TCG CCG 
Sor Sor Sor Val Ilo Thr Thr Thr Cys Gin Gin Arq Gin Gin Sor Val Sor Pre 

3512 

CAC AGC AAC GGT TCC AGC TCC AGT TCG AGC TCT AGC TCC AGC TCC AGT TCG TCA 
His Sor Asn Gly Ser Sor Ser Ser Sor Sor Ser Ser Ser Ser Ser Sor Sor Sor 



TCC TCC TCC ACA TCC TCC AAC TOT AGC TCC AGC TCG GCC AGC AGC TGC CAG TAT 
Sor Sor Sor Thr Sor Sor Asn Cys Sor Ser Ser Sor Ala Ser Sor Cys Gin Tyr 

1080 

TTC CAG TCG CCG CAC TCC ACC AGC AAC GGC ACC AGT GCA CCG GOG AGC TCC AGT 
Phe Gin Sor Pro His Sor Thr Ser Asn Giy Thr Sor Aia Pro Aia Sor Ser Ser 



TCG GGA TCG AAC AGC GCC ACG CCC CTC CTG GAA CTG CAG GTG GAC ATT GCT GAC 
Ser Gly Sor Asn Ser Ala Thr Pro Lou Leu Glu Leu Gin Val Asp He Ala Asp 



TCG GCG CAG CCT CTC AAT TTC TOC AAG AAA TOG COC ACG COG CCG CCC AGC AAG 
Ser Ala Gin Pro Leu Asn Leu Ser Lys Lys Ser Pro Thr Pro Pro Pro Ser Lys 

1134 

3782 

CTG CAC GCT CTG GTG GCC GCC GCC AAT GCC GTT CAA AGG TAT CCC ACA TTG TCC 
Lou His Ala Leu Val Ala Aia Ala Asn Ala Val Gin Arq Tyr Pro Thr Lou Ser 



GCC GAC GTC ACA GTG ACA GCC TCC AAT GGC GGG TCC TCC GTC GGC GGC GGC GAG 
Ala Asp Val Thr Val Thr Ala Ser Asn Gly Gly Ser Ser Val Gly Gly Giy Glu 



TCC GGC CGC CAG CAG CAG TOC GOC GGC GAG TGT GGG CTC CCC CAA TCC GGG CCT 
Sor Gly Arq Gin Gin Gin Sor Ala Giy Glu Cys Gly Lou Pro Gin Ser Gly Pro 

1186 

GAG CGC CGC OGT GCA CAA GGT AAT GCT GGA GGC GTA AGA GCG GGA GGA GGT AGG. 
Glu Arq Arq Arq Ala Gin Gly Asn Aia Gly Gly Val Arq Ala Giy Gly Giy Arq 



TGG TTT TAC GCG GAG AAG TGG GAG AGA CAG AGA CTG GGA GTG GCA GTT CAG CGA 
Trp Phe Tyr Aia Glu Lys Trp Glu Arq Gin Arq Lou Gly Val Ala Val Gin Arq 

4052 

AGC AGG AAG CAG GAT CAC TTG GAG CGG COG GAG TTG AAT TAA 
Ser Arq Lys Gin Asp His Leu Glu Arq Arq Glu Lou Asn 

1237 
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4052 

AGC AGG AAG CAG GAT CAC 7Z0 GAG COG CGG GAG 77G AAT TAA 
Sar Arq Lya Gin Asp Ris Lou Glu Art? Arg Giu Lou Asn . 

1237 

-ATrATTTrAeaVTTTAA^ 

7AATATTTAAAGCAACAACAAACAAAACAACTACAAGTTA 
4234 

AACAAAAAACOCAAGCTTGAATGGTATTACAAAAGAAAAAGAAAAACAGAAAAAAT^ 
3CAGTTAAACTTTAACGTAGCAAGAAACCAACAAACCCAAGGCA 

4 374 m 

TTCAGCTGCTACCGAAAACOCCOCTCACCTCCCCCCCACCCAACC^^ 

CGACCCCTGAITGTTTTATAAGTTTTAAGCTCT^ 
4514 

TTAGOGCTTTAGTTGTAGTTOGAGCAAAACTACT 
TATTAT TATTAAA TTTTTAAATACCT^ 

| ^ poly A sitQ cOm4927 and cDm4928 

4654 _ 

AAGCAGAATCAAGTGCAGTTTGCAACAAATTTTAAC 

ATTTAACTAATTTTATGAATTTGTTATAGTTTTCCAOCCTTCTATAGATCT^ 
4794 

VrCCGTATTCCTGATTTCTCCTTTOGCACAAAG^ 

TGTTTTCTAAITTTAAAACTACCACAAAAATACGATTAAAATATAGAC^ 
^934 

ATGCTTAAAGTTTTAGCAGCAAGCAGTAAAACGACGATGAA^ 

rrGTGTACATAGTTAAATCrrTAAATTAAACACAAAA^ 
5074 

tTMlX^GAAACCTATTXTTTAAAACATTX:^ 

AAAAAAAAATGAACAAACAAAAAT TATGAT ATAT TATTTAAAAGTAAAGTA TACATTTACATTACAGAAA 
5214 

AACAAAAGAGAAAACTTGCGGT AGCAACAAAACTAT TATA TTAATTACATTTT AATT A7GCTG TACT ATT 

ATGATTATTAATTATTAT«1TMTTMTC^ 
5354 

GCAAAAACC£X2&A&AAAAAAAACAAAAAACAAGCAAAAAAT 

putativo polyaaonylatlon signal for long transcripts 
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C»CGC5TTTGGAGTCAACCTCr:X^ 



ASS&CCOGAACEAACACAAC^^^ 



CTCTGC^^TirrSIZilTGCTGGT^ 

1 ► ntlNA stan site 

TXrAGAGCAAGAAAAOCGCOCGAAAAACC^ 



AGAGCAGAGTCAACAGCAACAAiC^TCAATAG^^ 

+211 

TGAGATCAMCTOVAGTGTGCft^^ 

284 

GTGTCATTTCGAAGCCAAAAAGCAAAATCTCTAATTCAAAT ATG GTT XT GCA ATG CAA 

MET Val Cys Ala MET C^r. 

302 

GAG GTT GCT GCT GTG CAG CAT CAG CAG CAG CAA CAG CAA CTC CAG TTG CCC CAG 
Giu Val Ala Ala Val Gin His Gin Gin Gin Gin Gin Gin Lau Gin Lou Pro Gir. 

24 

CAG CAA CAG CAG CAG CAG CAG ACA ACA CAG CAG CAA CAT OCA ACA ACG ATA GTG 

Gin Gin Gin Gin Gin Gin Gin Thr Thr Gin Gin Gin His Ala Thr Thr lie Vai 



CTG CTG ACG GGC AAT GGC GOC GGT AAT CTG CAC ATT GTC GCC ACA CCG CAA CAG 
Lou Lou Thr Giy Asn Gly Gly Giy Asn Lou His Ilo Val Ala Thr Pro Gin Gir. 



CAT CAG CCG ATG CAT CAG CTC CAC CAT CAG CAT CAG CAT CAG CAT CAG CAC CAG 
His Gin Pro MET His Gin Lou His His Gin His Gin His Gin his Gin His Gir. 

76 

CAG CAG GCC AAG AGC CAA CAG CTG AAG CAA CAA CAC TCG GCG CTG GTC AAG TTG 
Gir. Gin Ala Lys Sor Gin Gin Lou Lys Gin Gin His Sor Ala Leu Vai Lys Lou 

572 

CTG GAG TCG GCG CCC ATC AAG CAG CAA CAG CAG ACG CCC AAG CAA ATT GTT TAC 
Leu Giu Sor Ala Pro Ilo Lys Gin Gin Gin Gin Thr Pro Lys Gin lie Vai Tyr 



CTG CAG CAG CAG CAG CAG CAA CCG CAA CGC AAA AGA CTG AAA AAC GAA OZA CCA 
Lou Gin Gin Gin Gin Gin Gin Pro Gin Arg Lys Anj Lou Lys Asn Giu Ala Aid 

132 

ATC GTA CAA CAG CAA CAA CAA ACA CCT OCA ACA CTA GTA AAG ACA ACA ACC ACC 
II Val Gin Gin Gin Gxn Gin Thr Pro Ala Thr Lou Val Lys Thr Thr tr.r Thr 



AGC AAC AGC AAC AGC AAC AAC ACC CAG ACA ACA AAT AGT ATT ACT CAG CAG CAA 
Sor Asn Sor Asn Sor Asn Asn Thr Gin Thr Thr Asn Sor I^e Sor Gin Gin Gir. 
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AGC AAC AGC AAC AGC AAC AAC AOC CAG ACA ACA AAT AGT ATT AGT CAG CAG CAA 
Ser Asn Ser Asn Sor Asn Asn Thr Gin Thr Thr Asn Ser lie Ser Gin Gin Gin 



CAG GAG CAT CAG ATT- CTG TTG -CAG CAC CAG CAG CCA GCC-GCG OCA. OCA. ACA CCA 
Gin Gin His Gin II© val Leu Gin His Gin Gin ?rc Aia Ala Ala Aia Thr Pro 

186 

842 

AAG CCA TGT GCC GAT CTG AOC GCC AAA AAT CAC AGC GAG TCG GGC ATC GAC GAG 
Lys Pro Cys Aia Asp Leu Ser Ala Lys Asn Asp Ser Giu Ser Giy He Asp Giu 



GAC TOC CCC AAC AGC GAT GAG GAT TOC CCC AAT GOC AAC CCG GCG GGC ACA TCG 
Asp Cys Pro Asn Ser Asp Giu Asp Cys Pro Asn Ala Asn Pro Aia Giy Thr Ser 



CTC GAG GAC AGC AGC TAC GAG CAG TAT CAG TGC CCC TGG AAG AAG ATA CCC TAT 
Leu Glu Asp Ser Ser Tyr Glu Gin Tyr Gin Cys Pro Trp Lys Lys lie Arq Tyr 

240 

GOG CGT GAG CTC CTC AAG CAG COC GAG TTG GAG CAG CAG CAG ACC ACC CGA GGC 
Ala An? Glu Leu Leu Lys Gin Arq Glu Leu Giu Gin Gin Gin Thr Thr Giy Giy 



AGC AAC GCG CAG CAG CAA GTC GAG GCG AAG CCA GCT OCA ATA CCC ACC AGC AAC 
Ser Asn Ala Gin Gin Gin Val Giu Ala Lys Pro Ala Ala He Pro Thr Ser Asn 

1112 

ATC AAG CAG CTC CAC TGT GAT AGT CCC TTT TOG GCG CAG ACC CAC AAG GAA ATC 
lie Lys Gin Leu His Cys Asp Ser Pro Phe Ser Aia Gin Thr His Lys Giu He 

294 

GCC AAT CTC CTG CGC CAA CAG TOC CAG CAA CAA CAG GTT GTG GCC ACG CAG CAG 
Ala Asn Leu Leu An? Gin Gin Ser Gin Gin Gin Gin Vai Val Aia Thr Gin Gin 



CAG CAG CAA CAG CAG CAG CAG CAC CAG CAC CAG CAA CAA CGA AGG GAT AGC TCC 
Gin Gin Gin Gin Gin Gin Gin His Gin His Gin Gin Gin Arq Arq Asp Ser Ser 

GAC AGC AAC TOC TCG CTG ATC AGC AAC TCG AGC AAC TCC AGT GCG CCC AAT TGT 
Asp Ser Asn Cys Ser Leu MET Ser Asn Ser Ser Asn Ser Ser Ala Cly Asn Cys 

348 

TOC AOC TOC AAC GCT COC GAC GAC CAG CAG CTG GAG GAG ATG GAC GAG GCC CAC 
Cys Thr Cys Asn Ala Giy Asp Asp Gin Gin Leu Giu Giu MET Asp Giu Aia His 

1382 

GAT TCG GGC TOC CAC GAT GAA CTT TOC GAG CAG CAT CAC CAG CGA CTG GAC TCC 
Asp Ser Giy Cys Asp Asp Giu Leu Cys Giu Gin His His Gin Arq Leu Asp Ser 



TCC CAA CTG AAT TAC CTG TOC CAG AAG TTC GAT GAG AAA CTG GAC ACG GCG CTG 
Ser Gin Leu Asn Tyr Leu Cvs Gin Lys Phe Asp Glu Lys Leu Asp Thr Ala Leu 

402 

AGC AAC AOC AOC GCC AAC ACC GOG AOG AAC ACG CCA OCT GTA ACA GCT AAC GAA 
Ser Asn Ser Ser Aia Asn Thr Giy Arq Asn Thr Pro Aia Vai Thr Aia Asn Giu 

1544 

GAT CCC GAT qcaqqttraq 
Asp Ala Asp 
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WHAT IS CLAIMED IS: 
~~T. ~Arf isolated recombirianir^nucl^ic "acid wh ich ; 

upon expression, is capable of coding for other than a 
native vertebrate steroid receptor or fragment thereof, 
said nucleic acid comprising a segment having a sequence 
substantially homologous to a coding region of domains A, 
B, D, E or F from an insect steroid receptor super family 
member gene having substantial homology to a steroid 
binding domain. 

2. An isolated recombinant nucleic acid of 

Claim 1, wherein said insect steroid receptor superfamily 
member is EcR, DHR3, E75A or E75B. 

3. An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid encodes a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member. 

4. An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid is capable of 
hybridizing to an insect steroid receptor superfamily 
member gene segment under selective hybridization 
conditions • 

5. An isolated recombinant nucleic acid of 
Claim 4, wherein said selective hybridization conditions 
are stringent hybridization conditions. 

6. A cell transformed with an isolated recombinant 
nucleic acid of Claim 1. 

7. An isolated recombinant nucleic acid having a 
sequence exhibiting identify over 20 nucleotides of a 
coding segment of an insect steroid receptor superfamily 
member having steroid binding domain homology. 

8. An isolated recombinant nucleic acid of 
Claim 7, wherein said nucleic acid encodes a polypeptide 
which binds to a control element responsive to a ligand 
of an insect steroid receptor superfamily. 

9. A cell transformed with an isolated recombinant 
nucleic acid of Claim 7. 
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10. An isolated recombinant nucleic acid comprising 
a DNA sequence capable ^*bindiiig~to an~insect T^tefoTd 
receptor superfamily member other than 20-OH ecdysone 
receptor. 

11. An isolated recombinant nucleic acid of 
Claim 10 , wherein said insect steroid receptor 
superfamily member is DHR3 , E75A or E75B. 

12. An isolated recombinant nucleic acid of 
Claim 10, wherein said DNA sequence promotes 
transcription of an operably linked sequence in response 
to binding by said insect steroid receptor superfamily 
member. 

13. An isolated recombinant nucleic acid of 

Claim 10 , wherein said DNA sequence is operably linked to 
a DNA sequence encoding a polypeptide. 

14. An isolated recombinant nucleic acid of 
Claim 10 , wherein said isolated recombinant nucleic acid 
is an expression vector. 

15. A cell transformed with an isolated recombinant 
nucleic acid of Claim 10. 

16. A cell of Claim 15, wherein said cell also 
comprises said insect steroid receptor superfamily 
member. 

17. A recombinant nucleic acid comprising: 

a control element responsive to a ligand of an 
insect steroid receptor superfamily member 
ligand responsive control element; 

a non-heat shock promoter sequence; and 

a sequence comprising a reporter gene. 

18. A recombinant nucleic acid of Claim 17, wherein 
said non-heat shock promoter sequence is an alcohol 
dehydrogenase promoter. 

19. A cell transformed with a recombinant nucleic 
acid of Claim 17. 

20. A cell of Claim 19, wherein said cell is a 
mammalian cell. 
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21. A recombinant nucleic acid of Claim 17, 

comprising~additi~onal~ ~copies~of "control" Tements 

responsive to a ligand of an insect steroid receptor 
super family member. 

22. A method for monitoring expression of a 
reporter gene comprising the steps of: expressing a 
recombinant nucleic acid of Claim 17 , wherein no 
radioactive reagent is used in said monitoring of said 
reporter gene. 

23. A polypeptide comprising an insect steroid 
receptor superfamily member or fragment thereof, wherein 
said polypeptide is substantially free of naturally- 
associated insect cell components and exhibits a 
biological activity characteristic of an insect steroid 
receptor superfamily member with a steroid binding 
domain. 

24. A polypeptide of Claim 23, wherein said insect 
steroid receptor superfamily member is selected from the 
group consisting of EcR, DHR3, E75A and E75B. 

25. A polypeptide of Claim 23, wherein said insect 
steroid receptor superfamily member or fragment thereof 
also comprises a DNA binding domain. 

26. A polypeptide of Claim 23, wherein said insect 
is Drosophila melanogaster . 

27. A polypeptide of Claim 23, which is capable of 
binding to a hormone analogue selected from the group 
consisting of an insect hormone and an insect hormone 
agonist. 

28. A polypeptide of Claim 27, wherein said insect 
hormone is an ecdysteroid. 

29. A polypeptide of Claim 27, wherein said insect 
hormone is 20-OH ecdysone. 

30. A polypeptide of Claim 23, which is capable of 
binding to a DNA control element responsive to an insect 
horm ne. 

31. A polypeptide of Claim 30, wherein said 
polypeptide comprises a zinc-finger domain. 
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32. A polypeptide of Claim 30, wherein said insect 

hormone responsiW~DNA~co^ 

to a transcription unit which is responsive to said 
binding. 

5 33. A polypeptide of Claim 32, wherein said insect 

hormone responsive DNA control element is upstream from 
said transcription unit. 

34. A polypeptide of Claim 23 fused to a second 

polypeptide. 

10 35. A polypeptide of Claim 34 r wherein said second 

polypeptide is a heterologous polypeptide. 
36. A polypeptide of Claim 35, wherein said 

heterologous polypeptide comprises a second steroid 
receptor superfamily member. 

15 37. A polypeptide of Claim 23, wherein said 

fragment has a sequence substantially homologous to a 
consensus El region sequence. 

38. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 

20 consensus £2 region sequence. 

39. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 
consensus E3 region sequence. 

40. A polypeptide of Claim 23, wherein said 
25 fragment has a sequence comprising: 

a segment at least about 25% homologous to a 

consensus El region sequence; 
a segment at least about 30% homologous to a 

consensus E2 region sequence; and 
30 a segment at least about 30% homologous to a 

consensus E3 region sequence. 

41. A composition of matter comprising a 
polypeptide of Claim 23. 

42. A cell comprising a polypeptide of Claim 23. 
35 43. A cell of Claim 42, wherein said cell is a 

human cell. 
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44. An antibody or binding fragment thereof 

e^ifiatii^~b~i^^ — 

Claim 23, wherein said binding specificity is directed to 
an epitope characteristic of an insect steroid receptor 
superfamily member. 

45. A method for selecting DNA sequences capable of 
being specifically bound by an insect steroid receptor 
superfamily member , said method comprising the steps of: 

screening DNA sequences for binding to a polypeptide 

of Claim 23; and 
selecting said DNA sequences exhibiting said 

binding. 

46. A method of Claim 45, wherein said DNA sequence 
is operably linked to a gene selected from the group 
consisting of EcR, DHR3, E74 and E75 genes. 

47. A method for selecting ligands specific for 
binding to a hormone binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps of: 

screening compounds for binding to one or more 

superfamily members; and 
selecting compounds exhibiting specific binding to 

the members. 

48. A method of Claim 47, wherein said ligand is an 
ecdysteroid. 

49. A methpd of Claim 47, wherein said ligand is a 
20-OH ecdysone antagonist. 

50. A method for modulating insect physiology or 
development comprising the steps of: 

screening compounds for binding to an insect steroid 

receptor superfamily member; 
selecting said compounds exhibiting said binding; 

and 

administering to an insect said ligand. 

51. Am thod of Claim 50, wherein said modulating 
is lethal to said insect. 
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52. A fusion polypeptide comprising a hormone 
binding domairT of an insect" steroid* receptor sup rf ami ly 
member and a second polypeptide* 

53. A fusion polypeptide of Claim 52 , wherein said 
second polypeptide comprises a DNA binding domain from a 
second steroid receptor superfamily receptor member. 

54. A nucleic acid encoding a fusion polypeptide of 
Claim 52. 

55. A method for selecting ligands specific for 
binding to a ligand binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps of: 

combining: 

(i) a fusion polypeptide of Claim 52, wherein 

said fusion polypeptide comprises 
said ligand binding domain 
functionally linked to a DNA binding 
domain of a second steroid receptor 
superfamily member; and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of said second nucleic 
acid sequence is responsive to 
binding by said DNA binding domain; 
and 

screening compounds for an activity of inducing 
expression of said second polypeptide; and 
selecting said compounds. 

56. A method of Claim 55, wherein said combining 
occurs within a cell. 

57. A method of Claim 56, wherein said combining 
step results from expression upon transformation of said 
cell with a nucleic acid encoding said fusion 
polypeptide. 

58. A method for modulating insect physiology 
comprising the step of: 
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administ ring to an insect a ligand sel cted by a 
method of Claim^57 ^ 

59. A method of Claim 55 , wherein said binding 
domain is selected from binding domains of insect steroid 

5 receptor superf amily members selected from the group 

consisting of EcR, DHR3, E75A and E75B. 

60. A method of Claim 58, wherein said ligand is 
lethal to said insect. 

61. An isolated receptor control element non- 
10 responsive to 20-OH ecdysone comprising a DNA segment 

capable of binding to an insect steroid receptor 

superf amily member and capable of controlling expression 

of an operably linked gene. 

62. An isolated receptor control element of 

15 Claim 61, wherein said operably linked gene is within 

about 50 kb of said control element. 

63. An isolated receptor control element of 
Claim 61, wherein said insect steroid receptor 
superf amily member is DHR3, E75A or E75B. 

20 64. A method for producing a polypeptide comprising 

the steps of: 

selecting a cell which is substantially insensitive 
to exposure to an insect steroid receptor 
superf amily ligand; 
25 introducing into said cell: 

(i) a receptor for said ligand; and 

(ii) a nucleic acid sequence encoding said 

polypeptide, said nucleic acid 
sequence operably linked to a 
30 control element responsive to 

presence of said selected 
ligand, wherein a transformed 
cell is produced; and 
exposing said transformed cell to said ligand. 
35 65. A method of Claim 64, wherein said cell is a 

mammalian cell. 
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66. A method of Claim 64 , further comprising the 
st^~of"inteoducing'said cell into an "intact organismT. 

67. A method of Claim 64, wherein said cell is a 
plant cell, 

68. A method of Claim 64 , wherein said ligand is 
20 -OH ecdysone. 



b:\5490A-76.PCT 



WO 91/13167 



PCT/US91/01189 



EcR ORF 




Copia LTR 



SUBSTITUTE SHEET 



WO 91/13167 



PCT/US91/01189 



2/3 



Sail 



BamBZ/Bglll Scal/SacZ 



UBX leader and AUG 




SV40 
splice 
and poly A 



Bgal 
ORF 



F/&-2 



SUBSTITUTE SHEET 



WO 91/13167 



PCT/US91/01189 



3/3 



i 



EcR ORF 




SV40 
splice/ 
poly A 



F/G.—3. 

4 



SUBSTITUTE SHEET 



INTERNATI NAL SEARCH REPORT 



I. CLASSIFlCATIOt ' SUBJECT MATTER "t s it c-us.r>ni,an s.no i 



FCT" T JS91 0115° 



According to International Patent Classification UPC) or «o uctn Notional Cass ?.ca: on a~ • ,PQ 

-U-S.- CL.-:_435/-69...1.;_530.-i38.7-;._536/.2.7 Ll__ 

IPC(5>: C12P 21 -'06: A61K 35 'U: C07H 15 12 

I FIELDS SEARCHEO " 

V.i««it,r«! s "Ci"*****' »:«on Sear" ( '* a 3 " 



Classification Svstem 



C'asSi*"ic.n!ion S#ticois 



L'.S. CL. 



435/69.1; 530/38.7; 536/27 



CocuniMnt.ii.cn Searc^d otrer t:\in W.r..r.!^ DccuT»c"!:ition 
•o t^e E«lent sue* Documents ar.* Ir.cSjjoo n «*e F-csds 5r»arcr*eo • 



Chemical Abstracts, Biological Abstracts 



III. DOCUMENTS CONSIDERED TO BE RELEVANT 9 



Category * 



Ctation ol Document. " *ith indication. *m*re ar-nroorote. or :re 'oieo-" passages « 



Relevant to Claim No 13 



European Journal of Biochemistry, 
volume 189. issued 1990. M. Strangmann- 
Diekmann et ai . "Affinity labelling of 
Partially Purified Ecdysteroid Receptor 
with Bromoacetyiated 20-OH-ecdysone 
Derivative", pages 137-143, see pages 
138 and 141-143. 

Molecular and Cellular Endocrinology, 
volume 57. issued 1988. >J. I.ehmann et al 
E'-.;v steroid Receptors of the blowfly. 
Calliphora vicina: Partial Purification 
and 'Characterization of Ecdysteroid 
Bindina 1 ' . pages 239-249. see pages 
:A2. 244 and 246-248. 

T. Maniatis et al . "Molecular Cloning. 
A Laboratory Manual" published 1989. 
Cold Spring Harbor Laboratory Press 
i'X,T.) . see pages 8.60-8.63. 



23-4 3 



^ j - -i J 



7.8 



* Special caieqones ol cited documents: 10 
"A" document defining the general state of the art which is not 
considered to be ol particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which mav throw doubts on priority claim(s) or 
which is cited to establish the publication date ot another 
citation or other special reason (as specified) 

"0* document referring to an oral disclosure, use. emibilion or 
other means 

"P" document puPhshed pnor to the international filing aatc but 
later than the prior. t/ dale ciaimea 



Liter document oubhshed alter r»e international filmo aate 
or nnorit/ date and not m conflict Attn tre aooncat.on out 
ctod to understand the principle or theory underlying the 
invention 

document of particular relevance: tr»e claimed invention 
cannot oe consioerca novei or cannot do considered to 
mvoive an inventive step 

document ol particular relevance: the claimed invent-on 
cannot oe consiacred to mvoivi? an inventive steo wnen t**e 
document *% conjoined with one or more other sucn oocj- 
nients. sucn compination oemg op»»ous to a person silica 
• n tne art. 

document memoer o f the same patent 'amnv 



IV. CERTIFICATION 



Date of the Actual Completion ol me International Search 



Date ol Mailing ot this International Search Report 



31 Mav 1991 



27 m mi 



International Searching Author*!/ 



ISA/US 



Signature ol Authorised 0;t cer /"j . 

Deborah Crouch 



FormPCL:SA2i0iMCcrasrie«i)(Aev M B.* 



PCTTSQ1 ^JJjS 





— . — — 1 i^V 

FURTHER INFORM ION CONTINUED FROM THE SECONO SHEET 




t 
! 




• 


i 

i 
S 


i 
1 

i ! 

1 




V □ OBSERVATIONS WHERE CERTAIN CLAIMS WERE FOUND UNSEARCHABLE 1 


This international search report nas not been established in resoect ol certain claims under Article 17(2) (a) for the following reasons: 
Claim numbers . because they relate to suoject matter i- not required to be searched by this Authority, namely: 

2. Q Claim numbers . because they relate to oarts of the international application that do not comply with the preserved require* 

ments to such an evtent that no meaningful international search can be carried out *•». specifically: 

3. n Claim numbers . because mev are deoendent claims not drafted in accordance with the second and third sentences of 

PCT Rule 6.4(a). 


VI. P£ OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING » 


This International Searching Authority found multiple inventions in this international application as follows: 

See attachment 

As all required additional search fees were timely paid by the applicant, this international search report covers all searchable claims 
of the international application. 

2.0 As only some of the required additional search fees were timely paid by the applicant, this international searcn report covers only 
those claims of the international application for which tees were paid, specifically claims: 

Claims 1-5, 7-8, 23-43 telephone practice 

3.Q No reouired additional search fees were timely paid by the applicant. Consequently, this international search report is restricted to 
the invention first mentioned in the claims: it is covered by claim numoers: 

4 Q As all searchable claims could be searched without effort justifying an additional fee. the International Searching Authority did not 
*— ' invite payment ol any additional lee. 

Remark on Protest 

n The additional search tees were accompanied by applicant's protest. 

n No protest accompanied the payment ol additional search lees. 



Form PCTrtSA/210 (suppiemsriaj meet (2) (Rev. I t-B7> 



LACK OF UNITY OF INVENTION 



PCT/US91/01189 

I. Claims 1-5 and 7-8 ; insect steroxd receptor DNA; classified 
in 536/27 

II. Claims 6 and 9; cells transformed with insect steroid 
receptor DNA; classified in 435/69.1 

III. Claims 10-14; DNA sequence binding to insect steroid 
receptor DNA; classified in 536/27 

IV. Claims 15 and 16; cells transformed with DNA sequence 
binding to insect steroid receptor; classified in 435/69.1 

V. Claims 17-21; DNA construct; classified in 536/27 * 

VI. Claim 22; method for monitoring expression; classified in 
435/6 

VII. Claims 23-43; a polypeptide; classified in 530/350 

VIII. Claim 44; antibody; classified in 350/387 

IX. Claims 45-49; method to select DNA sequence which binds 
insect steroid receptor; classified in 435/6 

X. Claims 50, 51, 58 and 60; method to modulate insect 
physiology; classified in 424/405 

XI. Claims 52 and 53; fusion polypeptide; classified in 530/350 

XII. Claim 54; nucleic acid; classified in 536/27 

XIII. Claim 55-57 and 59; method for selecting ligands; classified 
in 436/501 

XIV. Claims 61-63; receptor control elements; classified in 
536/27 

XV. Claims 64-68; method to produce a polypeptide; classified in 
435/69. 1 



Groups I, II, III, IV, V. VII, VIII, XI, XII and XIV are distinct 
products. Groups VI, IX, X, XIII and XV are distinct methods. The 
claims of these groups are drawn to distinct inventions, which 
are not linked to form a single inventive concept. PCT rules 13. 1 
and 13.2 does not allow an application to contain more than one 
inventive concept. All other products, methods of making and 
methods of using are additional invnetions. 



